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TSO101-GAG INTERACTION AND USE THEREOF 
Technical Held 

The present invention generally relates to protein-protein interactions, and use 
5 thereof in drug screen and disease treatment 

Background of the Invention 
In contrast with the traditional view of protein function, which focuses on the 
action of a single protein molecule, a modem expanded view of protein function defines a 

10 protein as an element in an interaction network. See Eiseriberg et aL, Nature, 405:823- 
826 (2000). That is, a full understanding of the functions of a protein will require 
knowledge of not only the characteristics of the protein itself, but also its interactions or 
connections with other proteins in the same interacting network. In essence, protein- 
protein interactions form the basis of almost all biological processes, and each biological 

15 process is composed of a network of interacting proteins. For example, cellular 
structures such as cytoskeletons, nuclear pores, centrosomes, and kinetochores are 
formed by complex interactions among a multitude of proteins. Many enzymatic 
reactions ore associated with large protein complexes fanned by interactions among 
enzymes, protein substrates, and protein modulators- In addition, protein-protein 

20 interactions are also part of the mechanisms for signal transduction and other basic 

cellular functions such as DNA replication, transcription, and translation. For example, 
the complex transcription initiation process generally requires protein-protein interactions 



1 



WO 02/072790 



PCT/US02/08146 



among numerous transcription factors, RNA polymerase, and other proteins. See e.g., 
Tjian and Maniatis, Cell, 77:5-8 (1994). 

Because most proteins function through their interactions with other proteins, if a 
test protein interacts with a known protein, one can reasonably predict that the test 
5 protein is associated with the functions of the known protein, e.g., in the same cellular 
structure or same cellular process as the known protein. Thus, interaction partners can 
provide an immediate and reliable understanding towards the functions of the interacting 
proteins. By identifying interacting proteins, a better understanding of disease pathways 
and the cellular processes that result in diseases may be achieved, and important 

10 regulators and potential drag targets in disease pathways can be identified. 

There has been much interest in protein-protein interactions in the field of 
proteomics. A number of biochemical approaches have been used to identify interacting 
proteins. These approaches generally employ the affinities between interacting proteins 
to isolate proteins in a bound state. Examples of such methods include 

15 coimmunoprecipitation and copurification, optionally combined with cross-linking to 
stabilize the binding. Identities of the isolated protein interacting partners can be 
characterized by, e.g., mass spectrometry. See e.g., Rout et aL, J. Cell Biol, 148:635- 
651 (2000); Houry et aL w Nature, 402:147-154 (1999); Winter et al. a Curr. Biol, 7:517- 
529 (1997). A popular approach useful in large-scale screening is the phage display 

20 method, in which filamentous bacteriophage particles are made by recombinant DNA 

technologies to express a peptide or protein of interest fused to a capsid or coat protein of 
the bacteriophage. A whole library of peptides or proteins of interest can be expressed 
and a bait protein can be used to screening the library to identify peptides or proteins 
capable of binding to the bait protein. See e.g., U.S. Patent Nos. 5,223,409; 5,403,484; 

25 5,571,698; and 5,837,500. Notably, the phage display method only identifies those 
proteins capable of interacting in an in vitro environment, while the 
coimmunoprecipitation and copurification methods are not amenable to high throughput 
screening. 

The yeast two-hybrid system is a genetic method that overcomes certain 
30 shortcomings of the above approaches. The yeast two-hybrid system has proven to be a 
powerful method for the discovery of specific protein interactions in vivo. See generally, 
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B artel and Fields, eds., The Yeast Two-Hybrid System, Oxford University Press, New 
York, NY, 1997. Hie yeast two-hybrid technique is based on the fact that the DNA- 
binding domain and the transcriptional activation domain of a transcriptional activator 
contained in different fusion proteins can still activate gene transcription when they are 
5 brought into proximity to each other. In a yeast two-hybrid system, two fusion proteins 
are expressed in yeast cells. One has a DNA-binding domain of a transcriptional 
activator fused to a test protein. The other, on the other hand, includes a transcriptional 
activating domain of the transcriptional activator fused to another test protein. If the two 
test proteins interact with each other in vivo, the two domains of the transcriptional 
10 activator are brought together reconstituting the transcriptional activator and activating a 
reporter gene controUed by the transcriptional activator. See, e.g., U.S. Patent No. 
5,283,173. 

Because of its simplicity, efficiency and reliability, the yeast two-hybrid system 

* 

has gained tremendous popularity in many areas of research. In addition, yeast cells are 

IS eukaryotic cells. The interactions between mammalian proteins detected in the yeast 
two-hybrid system typically are bona fide interactions that occur in mammalian cells 
under physiological conditions. As a matter of fact, numerous mammalian protein- 
protein interactions have been identified using the yeast two-hybrid system. Hie 
identified proteins have contributed significantly to the understanding of many signal 

20 transduction pathways and other biological processes. For example, the yeast two-hybrid 
system has been successfully employed in identifying a large number of novel 
mammalian cell cycle regulators that are important in complex cell cycle regulations. 
Using known proteins that are important in cell cycle regulation as baits, other proteins 
involved in cell cycle control were identified by virtue of their ability to interact with the 

25 baits. See generally, Hannon et al, in The Yeast Two-Hybrid System, B artel and Fields, 
eds., pages 183-196, Oxford University Press, New York, NY, 1997. Examples of 
mammalian cell cycle regulators identified by the yeast two-hybrid system include 
CDK4/CDK6 inhibitors (e.g., pl6, pl5, pl8 andpl9), Rb family members (e.g., pl30), 
Rb phosphatase (e.g., PPl-a2), Rb-binding transcription factors (e.g., E2F-4 and E2F-5), 

30 General CDK inhibitors (e.g., p21 and p27), CAK cyclin (e.g., cyclin H), and CDK 
Hirl61 phosphatase (e.g„KAP and CDI1). See id at page 192. "[TJhe two-hybrid 
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approach promises to be a useful tool in our ongoing quest for new pieces of the cell 
cycle puzzle.'* See id at page 193. 

The yeast two-hybrid system can be employed to identify proteins that interact 
with a specific known protein involved in a disease pathway, and thus provide valuable 
5 understandings of die disease mechanism. Hie identified proteins and the protein-protein 
interactions they participate are potential drag targets for use in selecting new drugs for 
treating the disease. 

Human immunodeficiency virus (HIV) infection causes the acquired 
immunodeficiency syndrome (commonly known as AIDS). HTV is a retrovirus that 

10 primarily infects T cells expressing the CD4 glycoprotein, Le„ CD4 + T-cells, which are 
also known as helper T-cells, HIV virus multiplies in helper T-cells and quickly 
destroys the host helper T-cells, resulting in cellular immunity depression and leaving the 
infected patient susceptible to opportunistic infections, malignancies and various other 
pathological conditions. Ultimately, HTV infection can cause depletion of helper T-cells 

IS and collapse of a patient's immune defenses. Not surprisingly, HIV-infected individuals 
and AIDS patients typically develop AJDS-related conditions such as AIDS-ielated 
complex (ARC), progressive generalized lymphadenopathy (PGL), dementia, tropical 
paraparesis, Kaposi's sarcoma, thrombocytopenia purpurea, herpes infection, 
cytomegalovirus infection, Epatein-Barr virus related lymphomas among others. la any 

20 case, the HIV viruses in an infected individual are infectious and can be transmitted to 
other people through blood transfusion or sexual contacts. 

There has been a great deal of effort in the past fifteen years or so in developing 
methods and pharmaceutical compounds for treating HIV infection and AIDS. The 
therapeutic approaches have been mostly focused on a limited number of drug targets, 

25 namely HTV reverse transcriptase, HIV protease, and HIV integrate. A number of 

reverse transcriptase inhibitors and protease inhibitors have been developed or marketed. 
Examples of nucleoside reverse transcriptase inhibitors include Zidovudine, Stavudine, 
Lamivudine, and ddl. Examples of non-nucleoside reverse transcriptase inhibitors 
include Bf avirenz, Delavirdine, and Abacavir. In addition, a number of HIV protease 

30 inhibitors are commercially available including Ritonavir, Nelfinavir, Indinavir and 
Saquinavir. 
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However, HIV typically undergoes active mutations as it multiplies. In addition, 
there are extensive genetic variations in HIV partly due to high mutation rate. Therefore, 
mutations in HIV reverse transcriptase and protease arise frequently in infected 
individuals and render the virus resistant to the inhibitor administered to patients. 
5 Combination therapy, generally referred to as HAART (highly active anti-retroviral 
therapy), has been developed in which a combination of different anti-HTV inhibitors is 
administered to a patient. However, viral resistance to combination therapies still 
frequently develops. 

In addition, many of the anti-HTV compounds known in the art have other serious 
10 drawbacks. For example, the reverse transcriptase inhibitors such as AZT and ddl are 
fairly toxic and cause serious side effects in patients treated with such compounds. 
Therefore, although limited success for controlling HTV infection and AIDS has been 
achieved with previously developed anti-HTV compounds, there is. a need for alternative 
therapeutic approaches that overcome the shortcomings of currently available drugs. 

15 

Summary of the Invent^ 
It has been discovered that human tumor susceptibility, gene 101 ("TsglOl ") 
interacts with HIV GAOp6. Particularly, the interaction between TsglOl and HTV GAG 
is essential for HTV budding from host cells. Thus, the protein complexes formed by 

20 TsglOl and HIV GAG or GAGp6, as well as TsglOl alone, can be used in screening 

assays to select compounds capable of modulating the functions and activities of TsglOl 
and the protein complexes containing TsglOl and HIV GAG or GAGp6. The identified 
compounds can be useful in inhibiting lend virus propagation, particularly HIV 
propagation, and in treating HIV infection and AIDS. 

25 Accordingly, in accordance with a first aspect of the present invention, an isolated 

protein complex is provided having a first protein which is TsglOl or a homologue or 
derivative or fragment thereof interacting with a second protein which is HTV GAG 
polypeptide or a homologue or derivative or fragment thereof. In a preferred 
embodiment, the first protein is a fusion protein containing (a) TsglOl or (b) a TsglOl 

30 homologue or (c) a TsglOl fragment. In another prefeued embodiment, the second 
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protein can be a fusion protein containing (a) HIV GAO polypeptide or (b) an HIV OAO 
homoiogue or (c) an HIV OAO fragment 

In another aspect of die invention, an isolated protein complex is provided having 
a first protein which is TsglOl or a homoiogue or derivative or fragment thereof 
5 interacting with a second protein which is HTV GAGp6 or a homoiogue or derivative or 
fragment thereof . In a preferred embodiment, the first protein is a fusion protein 
containing (a) TsglOl or (b) a TsglOl homoiogue or (c) a TsglOl fragment In another 
preferred embodiment, the second protein is a fusion protein containing (a) HIV GAGp6 
polypeptide or (b) an HIV GAGp6 homoiogue or (c) an HIV GAGp6 fragment 

10 The present invention also provides an isolated protein complex comprising: (a) a 

first protein which is selected from group consisting of (i) TsglOl protein, (ii) a TsglOl 
protein homoiogue having an amino acid sequence at least 90% identical to that of 
TsglOl and capable of interacting with HIV GAGp6, (iii) a TsglOl protein fragment 
containing the TsglOl UEV domain, and (iv) a fusion protein containing said TsglOl 

15 protein, said TsglOl protein homoiogue or said TsglOl protein fragment; and (b) a 

second protein selected from the group consisting of (1) HTV OAO polypeptide, (2) an 
HIV GAO polypeptide homoiogue having an amino acid sequence at least 90% identical 
to that of HIV GAO polypeptide and capable of interacting with TsglOl , (3) HIV 
GAGpS, (4) an HIV GAGp6 homoiogue having an amino acid sequence at least 90% 

20 identical to that of HIV GAGp6 polypeptide and capable of interacting with TsglOl, (5) 
an HIV OAOp6 fragment capable of interacting with TsglOl, and (6) a fusion protein 
containing said HTV GAG polypeptide, said HIV GAG polypeptide homoiogue, said HIV 
GAGp6 protein, said HIV GAGp6 homoiogue or said HIV GAGp6 fragment In specific 
embodiments, the HIV G AGp6 fragment has a contiguous span of at least 10 amino acid 

25 residues of a naturally occurring HTV GAGp6, said contiguous span containing a 

P(T/S)AP late domain motif. Preferably, the HIV GAGp6 fragment contains an amino 
acid sequence selected from the group of SEQ ID NOs: 25-32. 

The present invention further provides an isolated protein complex comprising a 
first protein which is TsglOl or a homoiogue or derivative or fragment thereof interacting 

30 with a second protein which is a retrovirus GAG polypeptide containing a P(T/S)AP late 
domain motif or a homoiogue or derivative or fragment of said retrovirus GAG 
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polypeptide. Preferably, the retrovirus is a lentivirus. More preferably, the retrovirus is a 
primate lentivirus such as a virus selected from the group consisting of HTV-1, HIV-2, 
HTV-3, and simian immunodeficiency viruses. The lentivirus can also be a non-primate 
lentivirus selected from the group consisting of bovine lenti viruses, feline lentiviruses, 
5 aod ovine/caprine lentiviruses. Thus, an isolated protein complex can include: 

(a) a first protein which is selected from group consisting of 

(i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
90% identical to that of TsglOl and capable of interacting with HIV GAGp6, 

10 (iii) a TsglOl protein fragment containing the TsglOl UEV domain, and 

(iv) a fusion protein containing said TsglOl protein, said TsglOl proton 
homologue or said TsglOl protein fragment; and 

(b) a second protein selected from the group consisting of 

(1) a retrovirus GAG polypeptide having the P(T/S)AP late 

15 domain motif, 

(2) a homologue of said retrovirus GAG polypeptide, said homologue 
having an amino acid sequence at least 90% identical to that of said retrovirus GAG 
polypeptide and capable of interacting with TsglOl, 

(3) a fragment of said retrovirus GAG polypeptide, said fragment being 
20 capable of interacting with TsglOl, and 

(4) a fusion protein containing said retrovirus GAG polypeptide, said 
retrovirus GAG polypeptide homologue or said retrovirus GAG polypeptide fragment 

In another specific embodiment, an isolated protein complex can include: 

(a) a first protein which is selected from group consisting of 
25 (i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
90% identical to that of TsglOl and capable of interacting with HTV GAGp6, 

(iii) a TsglOl protein fragment containing the TsglOl UEV domain, and 

(iv) a fusion protein containing said TsglOl protein, said TsglOl protein 
30 homologue or said TsglOl protein fragment; and 

(b) a second protein selected from the group consisting of 
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(1) a primate lenti virus GAG polypeptide, 

(2) a primate lenti virus GAG polypeptide hamologue having an amino 
acid sequence at least 90% identical to that of said primate lentivirus GAG polypeptide 
and capable of interacting with TsglOl, 

5 (3) a primate lentiviros GAGp6 protein, 

(4) a primate lenti virus GAGp6 homologue having an amino acid 
sequence at least 90% identical to that of HIV GAGp6 polypeptide and capable of 
interacting with TsglOl, 

(5) a primate lenti virus GAGp6 fragment capable of interacting with 

10 TsglOl, and 

(6) a fusion protein containing said primate lentiviros GAG polypeptide, 
said primate lentiviros GAG polypeptide homologue, said primate lentivirus GAGp6 
protein, said primate lentivirus GAGp6 homologue or said primate lentivirus GAGp6 
fragment 

15 In yet another aspect, the present invention also provides a method for making the 

protein complexes of the present invention. The method includes the steps of providing 
the first protein and the second protein in the protein complexes of the present invention 
and contacting said first protein with said second protein. In addition, the protein 
complexes can be prepared by isolation or purification from tissues and cells or produced 

20 by recombinant expression of their protein members. 

The present invention further relates to a fusion protein having a first polypeptide 
covalently linked to a second polypeptide, wherein said first polypeptide is TsglOl or a 
homologue or fragment thereof, and wherein said second polypeptide is HIV GAG or a 
homologue or fragment thereof. Isolated nucleic acids encoding the fusion protein are 

25 also provided. 

In yet another aspect of the present invention, antibodies immonoreactive with the 
protein complexes of the present invention are provided. In one embodiment, an 
antibody is selectively immunoreacti ve with a protein complex of the present invention. 
In another embodiment, a bifunctional antibody is provided which has two different 
30 antigen binding sites, each being specific to a different interacting protein member in a 
protein complex of die present invention. The antibodies of the present invention can 
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take various forms including polyclonal antibodies, monoclonal antibodies, chimeric 
antibodies, antibody fragments such as Fv fragments, single-chain Fv fragments (scFv), 
Fab 1 fragments, and F<ab 1 ) 2 fragments. Preferably, the antibodies are partially or fully 
humanized antibodies. The antibodies of the present invention can be readily prepared 
5 using procedures generally known in the art For example, recombinant libraries such as 
phage display libraries and ribosome display libraries may be used to screen for 
antibodies with desirable specificities. In addition, various mutagenesis techniques such 
as site-directed mutagenesis and PGR diversification may be used in combination with 
the screening assays. 

10 The present invention further relates to protein microchips comprising one or 

more of the protein complexes of the present invention and/or one or more antibodies 
according to the present invention. Such protein microchips can be useful in large-scale 
high throughput screening assays involving the protein complexes. 

In yet another aspect of the present invention, screening methods are provided for 

IS selecting modulators of a protein complex formed between TsglOl or a homologue, 
derivative or fragment thereof and HIV GAG or a homologue, derivative or fragment 
thereof. Screening methods are also provided for selecting modulators of TsglOl. The 
compounds identified in the screening methods of the present invention can be used in 
studying the interaction between TsglOl and HIV GAG and understanding the 

20 mechanism ofHTV viral propagation. The selected compounds may also be useful in 
preventing or ameliorating diseases or disorders such as viral infection, particularly HIV 
infection and AIDS. 

Thus, test compounds may be screened in an in vitro binding assay to select 
compounds capable of binding a protein complex of the present invention or TsglOl . In 

25 addition, in vitro dissociation assays may also be employed to select compounds capable 
of dissociating the protein complexes identified in accordance with the present invention. 
An in vitro screening assay may also be used to select compounds that trigger or initiate 
the formation of, or stabilize* a protein complex of the present invention. In preferred 
embodiments, in vivo assays such as yeast two-hybrid assays and various derivatives 

30 thereof, preferably reverse two-hybrid assays, are utilized in selecting compounds that 
interfere with or disrupt protein-protein interactions between TsglOl or a homologue, 
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derivative or fragment thereof and HIV GAGp6 or a homologue, derivative or fragment 
thereof. In addition, systems such as yeast two-hybrid assays are also useful in selecting 
compounds capable of triggering or initiating, enhancing or stabilizing protein-protein 
interactions between TsglOl or a homologue, derivative or fragment thereof and HIV 
S OAGp6 or a homologue, derivative or fragment thereof. 

Thus, in one embodiment, the screening method includes the steps of contacting 
the first and second proteins of the protein complexes of the present invention in the 
presence of one or more test compounds, and detecting the interaction between said first 
protein and said second protein. Preferably, the first and second proteins ate fusion 

10 proteins having a detectable tag. The methods can be conducted in a substantially cell 
free environment or in a host cell, preferably in yeast cells. 

In a preferred embodiment, the screening method includes: (a) providing in a host 
cell a first fusion protein having TsglOl or a homologue or derivative or fragment 
thereof, and a second fusion protein having a retroviral GAG (e.g. HIV GAG) or 

IS homologue or derivative or fragment thereof, wherein a DNA binding domain is fused to 
one of the first and second proteins while a transcription-activating domain is fused to the 
other of said first and second proteins; (b) providing in said host cell a reporter gene, 
wherein the transcription of the reporter gene is determined by the interaction between 
the first protein and the second protein; (c) allowing said first and second fusion proteins 

20 to interact with each other within said host cell in the presence of a test compound; and 
(d) determining the presence or absence of expression of said reporter gene. 

In accordance with another aspect of the present invention, a host cell is provided 
comprising: 

(a) a first expression cassette having a promoter operably linked to a nucleic acid 
25 encoding a first protein, said first protein being selected from group consisting of 

(i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
90% identical to that of TsglOl and capable of interacting with HTV GAGp6, 

(iii) a TsglOl protein fragment containing the TsglOl UEV domain, and 
30 (iv) a fusion protein containing said TsglOl protein, said TsglOl protein 

homologue or said TsglOl protein fragment; and 

10 
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(b) a second expression cassette having a promoter operably linked to a nucleic 
acid encoding a second protein selected from the group consisting of 

(1) aretrovirus GAG polypeptide having the P(T/S)AP late 

domain motif , 

(2) a homologue of said retrovirus GAG polypeptide, said homologue 
having an amino acid sequence at least 90% identical to that of said retrovirus GAG 
polypeptide and capable of interacting with TsglOl, 

(3) a fragment of said retrovirus GAG polypeptide, said fragment being 
capable of interacting with TfcglOl, and 

(4) a fusion protein containing said retrovirus GAG polypeptide, said 
retrovirus GAG polypeptide homologue or said retrovirus GAG polypeptide 

fragment. 

In a preferred embodiment, the second protein is selected from the group 
consisting of 

(1) HTV GAG polypeptide, 

(2) a HTV GAG polypeptide homologue having an amino acid sequence at 
least 90% identical to that of HTV GAG polypeptide and capable of interacting with 
TsglOl, 

(3) HIV GAGp6 protein, 

(4) a HIV GAGp6 homologue having an amino acid sequence at least 90% 
identical to that of HTV GAGp6 polypeptide and capable of interacting with TsglOl, 

(5) a HIV GAGp6 fragment capable of interacting with TsglOl, and 

(6) a fusion protein containing said HIV GAG polypeptide, said HIV GAG 
polypeptide homologue, said HTV GAGp6 protein, said HIV GAGp6 homologue or said 
HTV GAGp6 fragment 

In another preferred embodiment of the host cell of the present invention, one of 
said first and second nucleic acids is linked to a nucleic acid encoding a DNA binding 
domain, and the other of said first and second nucleic adds is linked to a nucleic acid 
encoding a transcription-activation domain, whereby two fusion proteins can be produced 
in said host cell. Preferably, the host cell further comprises a reporter gene, wherein the 
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expression of the reporter gene is determined by the interaction between the first protein 
and the second protein. 

The present invention further relates to a method for providing a compound 
capable of interfering with an interaction between the first and second proteins in the 
5 protein complexes of the present invention, which comprises the steps of providing 
atomic coordinates defining a three-dimensional structure of a protein complex, and 
designing or selecting compounds capable of interfering with the interaction between said 
first protein and said second protein based on said atomic coordinates. 

In addition, the present invention also provides a method for selecting a 
10 compound capable of inhibiting a protein-protein interaction between TsglOl and HTV 
GAGp6, which comprises: 

contacting a test compound with a protein selected from group consisting of 

(i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
15 90% identical to that of TsglOl and capable of interacting with HIV GAOp6, 

(iii) a TsglOl protein fragment containing the TsglOl UEV domain, and 

(iv) a fusion protein containing said TsglOl protein, said TsglOl protein 
homologue or said TsglOl protein fragment; and 

determining whether said test compound is capable of binding said protein. In a 
20 preferred embodiment, the method further includes testing a selected test compound 

capable of binding said protein for its ability to interfere with a protein-protein interaction 
between TsglOl and HIV GAGp6, and optionally further testing a selected test 
compound capable of binding said protein for its ability to inhibit HIV viral budding from 
an HIV-infected host cell. 
25 The present invention further provides a method for selecting a compound 

capable of inhibiting a protein-protein interaction between TsglOl and a retroviral GAG 
containing a P(T/S)AP motif. The method comprises (a) providing atomic coordinates 
defining a three-dimensional structure of a protein selected from group consisting of (i) 
TsglOl protein, (ii) a TsglOl protein homologue having an amino acid sequence at least 
30 90% identical to that of TsglOl and capable of interacting with HIV GAGp6, (iii) a 

TsglOl protein fragment containing the TsglOl UEV domain, and (iv) a fusion protein 
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containing said TsglOl protein, said TsglOl protein homologue or said TsglOl protein 
fragment; and (b) designing or selecting compounds capable of interacting with said 
protein based on said atomic coordinates. In a preferred embodiment, the method further 
includes a step of testing a selected compound capable of interacting with said protein for 
its ability to interfere with a protein-protein interaction between TsglOl and the retroviral 
GAG, particularly HIV GAGp6, and optionally a step of testing a selected test compound 
capable of interacting with said protein for its ability to inhibit viral budding, particularly 
HIV viral budding from an HIV-infected host cell. 

The present invention also provides methods for modulating, in mammalian cells, 
a protein complex comprising TsglOl and a retroviral GAG containing a P(T/S)AP lale 
domain motif. In a specific embodiment, the retroviral GAG is HIV GAG. The methods 
can be used to inhibit viral budding, particularly HIV viral budding from infected host 
cells. Inhibition of viral budding prevents the viruses from being released from the 
infected host cells thereby suppressing further viral propagation. Accordingly, the 
present invention also encompasses methods of treating viral infection, particularly HIV 
infection, and methods of treating and preventing AIDS in patients. 

In one embodiment, the concentration of a protein complex having TsglOl 
interacting with the retroviral GAG is reduced in cells. Various methods can be 
employed to reduce the concentration of the protein complex. The protein complex 
concentration can be reduced by interfering with the interaction between TsglOl and the 
retroviral GAG (e.g., HTV GAG). For example, compounds capable of interfering with 
interactions between TsglOl and HIV GAG can be administered to cells in vitro or in 
vivo in a patient Such compounds can be small organic molecules capable of binding 
TsglOl protein, particularly the UEV domain of TsglOl protein, or HIV GAGp6. They 
can also be antibodies immunoreactive with the TsglOl protein or HTV GAGp6. 
Preferably, antibodies that bind to the UEV domain of the TsglOl protein are used. Also, 
the compounds can be small peptides derived from the HTV GAGp6 protein or mimetics 
thereof capable of binding TsglOl, or small peptides derived from the TsglOl protein or 
mimetics thereof capable of binding HTV GAGp6. 

In another embodiment, the methods include inhibiting the expression of the 
TsglOl protein and/or HTV GAG protein. The inhibition can be at the transcriptional, 
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translational, or post-translati cmal level* For example, antisense compounds and 
ribozyme compounds can be administered to human cells in vitro or in human bodies. 

In yet another embodiment, an antibody selectively imraunoreactive with a 
protein complex having TsglOl interacting with the retroviral GAG (e.g, HIV GAG) is 
administered to cells in vitro or in human bodies to inhibit the protein complex activities 
and/or reduce the concentration of the protein complex in the cells or patient 

The present invention provides a method for treating HIV infection and AIDS that 
is distinct from the therapeutic approaches heretofore known die in the art The method 
is targeted at a cellular protein of the host cells as well as its interaction with a viral 
protein. The interaction is required for HDTV budding from the infected host cells. 
Accordingly, it is less likely that HIV will develop viral resistance to the treatment 
according to the present invention. 

The foregoing and other advantages and features of the invention, and the manner 
in which the same are accomplished, will become more readily apparent upon 
consideration of the following detailed description of the invention taken in conjunction 
with the accompanying examples, which illustrate preferred and exemplary 
embodiments. 

Brief Description of Th e Drawings 

Figure 1 is a diagram summarizing the proposed pathways for die budding by 
viruses using different late domain motifs; 

Figure 2 is a competitive inhibition curve showing that the p6(l-14) peptide 
having die first 14 amino acid residues of HIV GAGp6 is capable of inhibiting protein- 
protein interaction between GST-p6 and myc-Tsgl01(l-207); 

Figure 3 is a Dixon plot showing p6(l-14) inhibition of the interaction between 
GST-p6 andmyc-Tsgl01(l-207); 

Figure 4 is another Dixon plot showing p6(l-14) inhibition of the interaction 
between GST-p6 and myc-TsglOl (1-207). 
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Detailed Description of the Invention 
1. Definitions 

The terms '•polypeptide/' '^protein," and "peptide" are used herein 
interchangeably to refer to amino acid chains in which the amino acid residues are linked 
5 by peptide bonds or modified peptide bonds. The amino acid chains can be of any length 
of greater than two amino acids. Unless otherwise specified, the terms "polypeptide," 
"protein," and "peptide" also encompass various modified forms thereof. Such modified 
forms may be naturally occurring modified forms or chemically modified forms. 
Examples of modified forms include, but are not limited to, glycosylated forms, 

10 phosphorylaled forms, myristoylated forms, palmitoylated forms, ribosylated forms, 
acetylated forms, ubiquitinated forms, etc. Modifications also include intra-molecnlar 
crosslinking and covalent attachment to various moieties such as lipids, flavin, biotin, 
polyethylene glycol or derivatives thereof, etc. In addition, modifications may also 
include cyclization, branching and cross-linking. Further, amino acids other than the 

IS conventional twenty amino acids encoded by genes may also be included in a 
polypeptide. 

The term "protein fragment" as used herein means a polypeptide that represents a 
portion of a protein. When a protein fragment exhibits interactions with another protein 
or protein fragment, the two entities are said to interact through interaction domains that 

20 aze contained within the entities. 

As used herein, the term "interacting" or "interaction" means that two protein 
domains, fragments or complete proteins exhibit sufficient physical affinity to each other 
so as to bring the two "interacting" protein domains or proteins physically close to each 
other. An extreme case of interaction is the formation of a chemical bond that results in 

25 continual and stable proximity of the two domains. Interactions that are based solely on 
physical affinities, although usually more dynamic than chemically bonded interactions, 
can be equally effective in co-localizing two proteins. Examples of physical affinities 
and chemical bonds include but are not limited to, forces caused by electrical charge 
differences, hydrophobicity, hydrogen bonds, van der Waals force, ionic force, covalent 

30 linkages, and combinations thereof. The state of proximity between the interacting 
domains or entities may be transient or permanent, reversible or irreversible. In any 

15 
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event, it is in contrast to and distinguishable from contact caused by natural random 
movement of two entities. Typically although not necessarily, an "interaction" is 
exhibited by the binding between the interacting domains or entities. Examples of 
interactions include specific interactions between antigen and antibody, ligand and 
5 receptor, enzyme and substrate, and the like. 

An 'Interaction" between two protein domains, fragments or complete proteins 
can be determined by a number of methods. For example, an interaction can be 
determined by functional assays such as the two-hybrid systems. Protein-protein 
interactions can also be determined by various biochemical approaches based on the 

10 affinity binding between the two interacting partners. Such biochemical methods 
generally known in the art include, but are not limited to, protein affinity 
chromatography, affinity blotting, immunoprecipitation, and the like. The binding 
constant for two interacting proteins, which reflects the strength or quality of the 
interaction, can also be determined using methods known in the art. See Phizicky and 

15 Relds, Microbiol Rev., 59:94-123 (1995). 

As used herein, the term "domain" means a functional portion, segment or region 
of a protein, or polypeptide. "Interaction domain*' refers specifically to a portion, 
segment or region of a protein, polypeptide or protein fragment that is responsible for the 
physical affinity of that protein, protein fragment or isolated domain for another protein, 

20 protein fragment or isolated domain. 

As used herein, the term "protein complex" means a composite unit that is a 
combination of two or more proteins formed by interaction between the proteins. 
Typically but not necessarily, a "protein complex" is formed by the binding of two or 
more proteins together through specific non-covalent binding affinities. However, 

25 covalent bonds may also be present between the interacting partners. For instance, the 
two interacting partners can be covalently crosslinked so that the protein complex 
becomes more stable. 

Hie term 'Isolated protein complex" means a protein complex present in a 
composition or environment that is different from that found in nature in its native or 

30 original cellular or body environment Preferably, an "isolated protein complex" is 

separated from at least 50%, more preferably at least 75%, most preferably at least 90% 
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of other naturally co-existing cellular or tissue components. Thus, an "isolated protein 
complex" may also be a naturally existing protein complex in an artificial preparation or 
a non-native host cell. An 'Isolated protein complex" may also be a "purified protein 
complex", that is, a substantially purified form in a substantially homogenous preparation 
5 substantially free of other cellular components, other polypeptides, viral materials, or 
culture medium, or when the protein components in the protein complex are chemically 
synthesized, chemical precursors or by-products associated with the chemical synthesis. 
A "purified protein complex" typically means a preparation containing preferably at least 
75%, more preferably at least 85%, and most preferably at least 95% a particular protein 
10 complex. A "purified protein complex" may be obtained from natural or recombinant 
host cells or other body samples by standard purification techniques, or by chemical 
synthesis. 

The terms "hybrid protein," "hybrid polypeptide," "hybrid peptide," "fusion 
protean," "fusion polypeptide," and "fusion peptide" are used herein interchangeably to 

15 mean a non-naturally occurring protein having a specified polypeptide molecule 

covalendy linked to one or more polypeptide molecules which do not naturally link to the 
specified polypeptide. Thus, a "hybrid protein" may be two naturally occurring proteins 
or fragments thereof linked together by a covalent linkage. A "hybrid protein" may also 
be a protein formed by covalently linking two artificial polypeptides together. Typically 

20 but not necessarily, the two or more polypeptide molecules are linked or "fused" together 
by a peptide bond forming a single non-branched polypeptide chain. 

As used herein, the term "homologue," when used in connection with a first 
native protein or fragment thereof that is discovered, according to the present invention, 
to interact with a second native protein or fragment thereof, means a polypeptide that 

25 exhibits an amino acid sequence homology and/or structural resemblance to the first 

native interacting protein, or to one of the interacting domains of die first native protein 
such that it is capable of interacting with the second native protein. Typically, a protein 
homologue of a native protein may have an amino acid sequence that is at least 50%, 
preferably at least 75%, more preferably at least 80%, 85%, 86%, 87%, 88% or 89%, 

30 even more preferably at least 90%, 91%, 92%, 93% or 94%, and most preferably 95%, 
96%, 97%, 98% or 99% identical to the native protein. Examples of homologies may be 

17 



WO 02/072790 



PCT/US02/08146 



the ortholog proteins of other species including animals, plants* yeast, bacteria, and the 
like. Homologues may also be selected by, e.g., mutagenesis in a native protein. For 
example, homologues may be identified by site-specific mutagenesis in combination with 
assays for detecting protein-protein interactions, e.g., the yeast two-hybrid system 

5 described below, as will be apparent to skilled artisans apprised of the present invention. 

For purposes of comparing two different nucleic acid or polypeptide sequences, 
one sequence (test sequence) may be described to be a specific "percent identical to" 
another sequence (reference sequence) in the present disclosure. In this respect, when the 
length of the test sequence is less than 90% of the length of the reference sequence, the 

10 percentage identity is determined by the algorithm of Myers and Miller, Bull Math. Biol, 
51:5-37 (1989) and Myers and Miller, Comput. Appl. Biosci., 4(l):ll-7 (1988). 
Specifically, the identity is determined by the ALIGN program, which is available at 
http://www2igh.cnrs.fr maintained by IGH, Montpellier, FRANCE. Typically the 
default parameters should be used. A modified form of the ALIGN program may also be 

15 used. 

Where the length of the test sequence is at least 90% of the length of the reference 
sequence, the percentage identity is determined by the algorithm of Katiin and Altschul, 
Proc. Natl Acad. ScL USA, 90:5873-77 (1993), which is incorporated into the various 
BLAST programs. Specifically, the percentage identity is determined by the "BLAST 2 

20 Sequences" tool, which is available at http://www.nc^.nlm.nih.gov/gori^bl2Jitml. See 
Tatusova and Madden, FEMS Microbiol Lett., 174(2):247-50 (1999), For pairwise 
DNA-DNA comparison, the BLASTN 2.1.2 program is used with default parameters 
(Match: 1; Mismatch: -2; Open gap: 5 penalties; extension gap: 2 penalties; gap 
x_dropoff: 50; expect: 10; and word size: 11, with filter). For pairwise protein-protein 

25 sequence comparison, the BLASTP 2. 1 .2 program is employed using default parameters 
(Matrix: BLOSUM62; gap open: 11; gap extension: 1; xjdropoff: 15; expect 10.0; and 
wordsize: 3, with filter). 

The term "derivative," when used in connection with a first native protein (or 
fragment thereof) that is discovered, according to the present invention, to interact with a 

30 second native protein (or fragment thereof), means a modified form of the first native 
protein prepared by modifying the side chain groups of the first native protein without 
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changing the amino acid sequence of the first native protein. The modified form, i.e., the 
derivative should he capable of interacting with the second native protein. Examples of 
modified forms include glycosylated forms, phosphorylated forms, myristylated forms, 
ribosylated forms, ubiquitinated forms, and the like. Derivatives also include hybrid or 
fusion proteins containing a native protein or a fragment thereof. Derivatives can be 
prepared using any known techniques and tested for their interaction with the second 
native protein. 

The team "antibody" as used herein encompasses both monoclonal and polyclonal 
antibodies that fall within any antibody classes, e.g., IgO, IgM, IgA, or derivatives 
thereof. The term "antibody" also includes antibody fragments including, but not limited 
to, Fab, F(ab # )2, and conjugates of such fragments, and single-chain antibodies 
comprising an antigen recognition epitope. In addition, the team "antibody" also means 
humanized antibodies, including partially or fully humanized antibodies. An antibody 
may be obtained from an animal, or from a hybridoma cell line producing a monoclonal 
antibody, or obtained from cells or libraries recombinantly expressing a gene encoding a 
particular antibody. 

The term "selectively immunoreactive" as used herein means that an antibody is 
reactive thus binds to a specific protein or protein complex, but not other similar proteins 
or fragments or components thereof. 

The term "compound" as used herein encompasses all types of organic or 
inorganic molecules, including but not limited proteins, peptides, polysaccharides, lipids, 
nucleic acids, small organic molecules, inorganic compounds, and derivatives thereof. 

Unless otherwise specified, the term "TsglOl" as used herein means human 
TsglOl protein. Unless otherwise specified, the term "HIV GAG" as used herein means 
HIV GAG protein. Unless otherwise specified, the term "HIV GAGp6" as used herein 
means HJV GAGp6 protein. 

As used herein, the term "HIV infection" generally encompasses infection of a 
host animal, particularly a human host, by the human immunodeficiency virus (HIV) 
family of retroviruses including, but not limited to, HIV I, HTV II, HIV m, and the like. 
Thus, treating HTV infection will encompass the treatment of a person who is a carrier of 
any of the HTV family of retroviruses or a person who is diagnosed of active AIDS, as 
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well as the treatment or prophylaxis of the ADDS-related conditions in such persons. A 
carrier of HTV may be identified by any methods known in the ait For example, a person 
can be identified as a HIV carrier on the basis that the person is anti-HIV antibody 
positive, or is HIV-positive, or has symptoms of AIDS. That is, "treating HIV infection" 
5 should be understood as treating a patient who is at any one of the several stages of HTV 
infection progression, which, for example, inclnde acute primary infection syndrome 
(which can be asymptomatic or associated with an influenza-like illness with fevers, 
malaise, diarrhea and neurologic symptoms such as headache), asymptomatic infection 
(which is the long latent period with a gradual decline in the number of circulating CD 4 * 

10 T cells), and AIDS (which is defined by more serious AIDS-defining illnesses and/or a 
decline in the circulating CD4 cell count to below a level that is compatible with effective 
immune function). In addition, "treating or preventing HIV infection 11 will also 
encompass treating suspected infection by HIV after suspected past exposure to HTV by 
e.g., blood transfusion, exchange of body fluids, bites, accidental needle stick, or 

15 exposure to patient blood during surgery. 

The term "treating AIDS" means treating a patient who exhibits more serious 
AIDS-defining illnesses and/or a decline in the circulating CD4 cell count to below a 
level that is compatible with effective immune function. The term "treating ADDS" also 
encompasses treating AIDS-related conditions, which means disorders and diseases 

20 incidental to or associated with AIDS or HIV infection such as AIDS-related complex 
(ARC), progressive generalized lymphadenopathy (PGL), anti-EQV antibody positive 
conditions, and HIV-positive conditions, AIDS-related neurological conditions (such as 
dementia or tropical paraparesis), Kaposi's sarcoma, thrombocytopenia purpurea and 
associated opportunistic infections such as Pneumocystis carinii pneumonia, 

25 Mycobacterial tuberculosis* esophageal candidiasis, toxoplasmosis of the brain, CMV 
retinitis, HIV-related encephalopathy, HIV-related wasting syndrome, etc. 

Thus, the term "preventing AIDS" as used herein means preventing in a patient 
who has HIV infection or is suspected to have HTV infection oar is at risk of HIV infection 
from developing AIDS (which is characterized by more serious AIDS-defining illnesses 

30 and/or a decline in the circulating CD4 cell count to below a level that is compatible with 
effective immune function) and/or AIDS-related conditions. 
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2. Protein Complexes 
Novel protein-protein interactions have been discovered and confirmed using 
yeast two-hybrid systems. In particular, it has been discovered that TsglOl interacts with 
HIV GAGp6. Binding regions of TsglOl and HIV GAGp6 discovered in yeast two- 
hybrid systems are summarized in Table 1 . Hie GenBank accession numbers for the 
gene sequences and amino acid sequences of TsglOl and HIV GAGp6 axe noted in Table 
1 below. 



Table 1: Binding Regions of HIV Gag and TsplOl 



Baft 


Prey 


Bait Protein 


AA 

Coordinates 


Prey Protein 


AA 

Coordinates 




Start 


End 




Start 


End 


HIV Gag 
(GenBank 
Accession No. 
AF324493) 


449 


500 


Tumor Suppressor Gene 101 
(TsglOl) (GenBank 
Accession No. U82130) 


7 


390 



2J. TsglOl Is Involved in BffV Viral Budding 

Tumor susceptibility gene 101 (TsglOl) was originally identified as a 381 amino 
acid polypeptide involved in tumorigenesis. TsglOl can be localized in the nucleus and 
in the cytoplasm depending on the stage of cell cycle. TsglOl interacts with stathmin, a 
cytosolic phosphoprotein implicated in tumorigenesis, and overexpression of a TsglOl 
anti-sense transcript in NIH-3T3 cells results in transformation of the cells. See Li and 
Cohen, Cell, 85(3):3 19-29 (1996). Furthermore, it has been suggested that defects in 
TsglOl may occur during breast cancer tumori genesis and/or progression, Li et aL, Cell, 
88(1): 143-54 (1997). TsglOl contains a uMquitin-conjugadng enzyme E2 catalytic 
domain. Recently, interest has focused on TsglOl as a possible component of the 
ubiquitin/proteasome degradation pathway. By database search and comparison, it has 
been found that that N-terminal TsglOl contains a domain related to E2 ubiquitm- 
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conjugating (Ubc) enzymes although lacking the active site cysteine. See Koonin and 
Abagyan, Nat. Genet., 16(4):330-1 (1997). Thus, TsglOl may belong to a group of 
apparently inactive homologs of Ubc enzymes. See id. The domain related to E2 
ubiquitin-conjugating (Ubc) enzymes is referred to nbiquitin E2 variant (UEV) domain. 
5 In accordance with the present invention, a search of a human spleen library with 

GAG polyprotein (aa 449-500, p6 domain, or "GAGP6") of HTV-1 isolated the tumor 
susceptibility TSG 101 protein (TsglOl; aa 7-390) as an interactor. The GAGp6 bait 
used here contains a late domain motif (-PTAP-). The GAG polyprotein of retroviruses 
gives rise to a set of mature proteins (matrix, capsid, and nucleocapsid) that produce the 

10 inner virion core. In addition, GAG also contains a C-terminal portion called p6. In the 
case of HIV1 , GAGp6 contains a sequence called the late domain, so-called because it is 
required for a late stage of HTV viral budding from the host cell surface. The late domain 
has a functional relationship with ubiquitin, in that the late domain is required in viral 
budding, and depletion of the intracellular pool of free ubiquitin produces a similar late 

15 phenotype. Patnaik et al., Proc. Natl Acad. Set USA, 97(24): 13069-74 (2000); Schubert 
et a/., Proc. Natl Acad. Set. USA, 97(24): 13057-62 (2000); Strack et aL, Proc. Nad. 
Acad. Sci. USA, 97(24): 13063-8 (2000). The late domain is thought to represent a 
docking site for the ubiquitination machinery. 

As is known in the art, the P(T/S) AP motif is conserved among the GAGp6 

20 domains of all known primate Antiviruses. In nonprimate lentiviruses, which lack a 
GAGp6 domain, the P(T/S)AP motif is at the immediate C terminus of the GAG 
polyprotein. It has been shown that the P(T/S)AP motif is required for a late stage of 
viral budding from the host cell surface. It is critical for lenti virus' and particularly HTV's 
particle production. See Huang et al„ J. Virol, 69:6810-6818 (1995). Specifically, 

25 deletion of the PTAP motif results in drastic reduction of viral particle production. In 
addition, the PTAP-deficient viruses proceeded through the typical stages of 
morphogenesis but failed to complete Jhe process. Rather, they remain tethered to the 
plasma membrane and are thus rendered non-infectious. That is, the viral budding 
process is stalled. See Huang etal, J. Virol., 69:6810-6818 (1995). 

30 In accordance with the present invention, different GAGp6 point mutants (E6G, 

P7L, A9R, or P10L) were generated and tested for their ability to bind TsglOl protein. 
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See Example 2 below. While the wild-type GAGp6 peptide and the E6Q OAGp6 mutant 
were capable of binding TsglOl protein, each of the P7L, A9R, and P10L point 
mutations abolishes the GAGp6 binding affinity to TsglOl . The P7L, A9R, and P10L 
point mutations alter the PTAP motif in GAGp6 peptide. The same mutations in die 
PTAP motif of the HIV GAGp6 gag protein prevent HTV particles from budding from die 
host cells. See Huang et aL, J. ViroU 69:6810-6818 (1995). Further, as shown in 
Example 3 below, the inventors of the present inveniton discovered that the first 14 
amino acid residues of HTV GAGp6 (which includes the PTAP late domain motif) are 
sufficient in binding to the N-tenninal portion of TsglOl (amino acid residues 1-207, 
which includes the TsglOl UEV domain). 

TsglOl is intimately involved in endocytosis, intracellular vesicle trafficking, and 
vacuolar protein sorting (VPS). The VPS pathway sorts membrane-bound protons far 
eventual degradation m the ly so some (vacuole in yeast). See Lemmon and Traub, Curr. 
Opiru CeU. BioL, 12:457-66 (2000). Two alternative entrees into the VPS pathway are 
via vesicular trafficking from the Golgi (e.g. f in degrading misfolded membrane proteins) 
or via endocytosis from die plasma membrane (e.g., in downregulating surface proteins 
like epidermal growth factor receptor (EGFR)). Vesicles carrying proteins from either 
source can enter die VPS pathway by fusing with endosomes. As these endosomes 
mature, their cargos are sorted for lysosomal degradation via the formation of structures 
called multivesicular bodies (MVB). MVB are created when surface patches on late 
endosomes bud into the compartment, forming small (-50-100 nm) vesicles. A maturing 
MVB can contain tens or even hundreds of these vesicles. The MVB then fuses with the 
lysosome, releasing the vesicles for degradation in this hydrolytic organelle. 

The TsglOl prey fragment isolated in yeast two-hybrid assay contains the 
ubiquitin E2 variant (UEV) domain indicating that the UEV domain is involved in die 
binding to the P(T/S)AP domain. The involvement of the TsglOl UEV domain is 
consistent with the fact that ubiquitin is required for retrovirus budding and that 
proteasome inhibition reduces the level of free ubiquitin in HTV- 1 -infected cells and 
interferes with the release and maturation of HIV-1 and HTV-2. See Patnaik et al 9 Proc 
Natl. Acad. Sci. USA, 97(24): 13069-74 (2000); Schubert et al. 9 Proc. Natl Acad. ScL 
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USA, 97(24): 13057-62 (2000); Strack et al, Proc. Natl Acad. Set USA, 97(24): 13063-8 
(2000). 

It is known that short chains of Ub (1-3 molecules) can "mark" surface receptors 
for endocytosis and degradation in the lysosome. Hicke, Trends Cell Biol, 9:107-112 
5 (1999); Rotin et al, J. Membr. Biol, 176:1-17 (2000). Several classes of proteins that 
carry the P(T/S) AP motif are surface receptors known to be degraded via the VPS 
pathway or function in the VPS pathway. See Farr et aL, Biochem. J., 345(3):503-509 
(2000); Staub and Rotnt, Structure, 4:495-499 (1996). Although it is not known whether 
TsglOl lacks ubiquitin ligase activity, it is believed, based on the large number of TsglOl 

10 mteractors discovered in accordance with the present invention, that a plausible role for 
TsglOl in the VPS pathway is to recognize ubiquitinated proteins thai cany P(T/S)AP 
motifs and help coordinate their incorporation into vesicles that bud into the MVB. 

This is especially intriguing because the formation of MVB is the only known 
cellular process in which cell buds a vesicle out of the cytoplasm into another 

15 compartment This budding is topologically equivalent to viral budding in which viruses 
bud out of the cytoplasm at the plasma membrane into excellular space. Accordingly, 
while not wishing to be bound by any theory, it is believed that the binding of the 
P(T/S)AP motif in lentivirus GAG polyproteins to the cellular protein TsglOl enables 
lentiviruses having the P(T/S)AP motif to usurp cellular machinery normally used far 

20 MVB formation to allow viral budding from the plasma membrane. It is also believed 
that depletion of TsglOl or interfering with the interaction between TsglOl and the 
P(T/S)AP motif in lentivims-infected cells will prevent lentivirual budding from the cells. 

In addition, the recruitment of cellular machinery to facilitate virus budding 
appears to be a general phenomenon, and distinct late domains have been identified in the 

25 structural proteins of several other enveloped viruses. See Vogt, Proc. Natl. Acad. Set 
. USA, 97:12945-12947 (2000). Two well characterized late domains are the "PY" motif 
(consensus sequence: PPXY; X- any amino acid) found in membrane-associated proteins 
from certain enveloped viruses. See Craven et al. 9 J. Virol, 73:3359-3365 (1999); Harty 
et al. % Proc. Natl Acad. Set USA, 97:13871-13876 (2000); Harty et al., J. Virol, 

30 73:2921-2929 (1999); and Jayakar et at., J. Virol, 74:9818-9827 (2000). The cellular 
target for the FY motif is Nedd4 winch also contains a Hect ubiquitin E3 ligase domain. 
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The "YL" motif (YXXL) was found in the GAO protein of equine infectious anemia 
virus (EIAV). Puffer et al., J. Virol, 71:6541-6546 (1997); Puffer et al., J. Virol, 
72:10218-10221 (1998). The cellular receptor for the "YL** motif appears to be the AP- 
SOsubunitof AP-2. Puffers a/.,/. ViroL, 72:10218-10221 (1998). Interestingly, the 
late domains such as the P(T/S)AP motif, PY motif and the YL motif can still function 
when moved to different positions within retroviral OAO proteins, which suggests that 
they are docking sites for cellular factors rather than structural elements. Parent et al., J. 
Virol, 69:5455-5460 (1995); Yuan et al, EMBO 18:4700-4710 (2000). Moreover, the 
late domains such as the P(T7S)AP motif; PY motif and the YL motif can function 
interchangeably. That is one late domain motif can be used m place of another late 
domain motif without affecting viral budding. Parent et al., J. Virol, 69:5455-5460 
(1995); Yuan et al, EMBO J., 18:4700-4710 (2000); Strack et al., Proc. Natl. Acad. Set 
USA 97:13063-13068 (2000). 

Accordingly, while not wishing to be bound by any theory, it is believed that as 
shown in Figure 1, although the three late domain motifs bind to different cellular targets, 
they utilize common cellular pathways to effect viral budding. In particular, it is believed 
that the different cellular receptors for viral late domain motifs feed into common 
downstream steps of the vacuolar protein sorting (VPS) and MVB pathway. As 
discussed above, TsglOl functions in the VPS pathway. Another protein, Vps4 functions 
in TsglOl cycling and endosomal trafficking. Particularly, Vps4 mutants prevent normal 
TsglOl trafficking and induce formation of aberrant, highly vacuolated endosomes that 
are defective in the sorting and recycling of endocytosed substrates. See Babst et al, 
Trqffic, 1:248-258 (2000). 

Interestingly, a search of a spleen library with the tumor susceptibility protein 
TsglOl also identified an interaction with the growth arrest-specific protein GASTb. In 
addition, as disclosed in the commonly assigned U.S. Provisional Application Serial No. 
60/31 1,528, GAS7b is an interactor of the capsid region of the HIV GAG polyprotein. 
GAS7b is expressed preferentially in cells that are entering the quiescent state. Inhibition 
of GAS7b expression in terminally differentiating cultures of embryonic murine 
cerebellum impedes neurite outgrowth, while overexpression in undifferentiated 
neuroblastoma cell cultures dramatically promotes neurite-like outgrowth. Ju et al, Proc 
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NatlAcadSci 95(19): 11423-8 (1998); Lazakovitch et aL, Genomics 61(3):298-306 
(1999). These findings suggest a role for GAS7b in controlling terminal cellular 
differentiation, and the domain structure of GAS7b suggests it may do this by regulating 
the cytoskeleton. In addition, GAS7b also interacts with two different regulators of small 
5 GTPases that control the actin cytoskeleton. The interactions of GAS7b with the HIV 
capsid and with TsglOl (which in tain interacts with the HIV GAGp6 protein) strongly 
suggest these proteins form a multimolecular complex involved in the late stages of viral 
assembly and budding. 

10 2*2. Protein Complexes 

As discussed above, die UEV domain of the TsglOl protein and the PTAP motif 
of the HIV GAGp6 are responsible for the interactions. In addition, an examination of 
HTV-1 amino acid sequence variants in GenBank by the inventors using BLAST (Basic 
Local Alignment Search Tool) identified a number of HIV strains with the standard 

IS PTAP motif being replaced with variations of the motif; indicating that such variations 
can also enable viral budding and that peptides with such variations may also bind 
TsglOl . Such identified variations include the PSAP motif, the PIAP motif (see Zhang et 
al, /. Virol, 71:6662-6670 (1997); Farrar et al„ J. Med. Virol, 34:104-113 (1991)), and 
the PTTP motif (see Zhang et al, J. Virol, 71:6662-6670 (1997). 

20 Accordingly, the present invention provides protein complexes formed by 

interactions between TsglOl and HIV GAG polypeptide. The present invention also 
provides a protein complex having a homologue, derivative or fragment of TsglOl 
interacting with HIV GAG polypeptide. In addition, the present invention further 
encompasses a protein complex having TsglOl interacting with a homologue, derivative 

25 or fragment of HIV GAG polypeptide. In yet another embodiment, a protein complex is 
provided having a homologue, derivative or fragment of TsglOl and a homologue, 
derivative or fragment of HIV GAG polypeptide. In other words, one or more of the 
interacting protein members of a protein complex of the present invention may be a 
native protein or a homologue, derivative or fragment of a native protein. As defined in 

30 Section 1 above, the homologues and derivatives of TsglOl should be capable of 

interacting with HIV GAG (through the GAGp6 region). The HIV GAG homologues 
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and derivatives are those capable of interacting with TsglOl, Preferably, the amino acid 
sequence of a homologue is at least 85%, 90% t or 95% identical to that of its 
corresponding native protein. 

Thus, for example, one interacting partner in the protein complexes can be a 
5 complete native TsglOl, a TsglOl homologue capable of interacting with the HIV 

GAGp6, aTsglOl derivative, a derivative of the TsglOl homologue, a TsglOl fragment 
capable of interacting with HIV GAGp6 (e.g., a fragment containing the UEV domain of 
the TsglOl protein, specifically the amino acid residues 1-207] the amino acid residues 1- 
147, etc.), a derivative of the TsglOl fragment, or a fusion protein containing (1) 

10 complete native TsglOl, (2) a TsglOl homologue capable of interacting with the HIV 
GAGp6 or (3) a TsglOl fragment capable of interacting with HIV GAGp6. 

The other partner can be (1) an HIV GAG polypeptide, (2) an HIV GAG 
polypeptide homologue having an ammo acid sequence at least 85%, 90% or 95% 
identical to that of HW GAG polypeptide and capable of interacting with TsglOl, (3) 

15 HIV GAGp6 protein, (4) an HIV GAGp6 homologue having an amino acid sequence at 
least 80%, 85%, 90% or 95% identical to that of HIV GAGp6 and capable of interacting 
with TsglOl, (5) an HTV GAGp6 fragment capable of interacting with TsglOl, or (6) a 
fusion protein containing an HIV GAG polypeptide, an HIV GAG polypeptide 
homologue, HIV GAGp6, an HIV GAGp6 homologue or an HTV GAGp6 fragment 

20 The protein complexes of the present invention contains a HTV GAG polypeptide 

as an interacting partner. In addition, GAG polypeptides and fragments thereof from 
other retroviruses containing the P(T/S/I)(A/T)P (SEQ ID NOs:l-€) late domain motif are 
believed to also interact with TsglOl in the same manner as the HIV GAG polypeptide. 
Thus, they can be used in forming protein complexes with TsglOl or a homologue or 

25 derivative or fragment thereof. Preferably, GAG polypeptides or fragments thereof of 
lentiviruses containing the P(T/S)AP late domain are used to form protein complexes. 
Such GAG polypeptides or fragments thereof may be from a non-primate lentiviruses 
including bovine lentiviruses (e.g. bovine immunodeficiency virus (BIV), Jembrana 
disease virus), feline lentiviruses (e.g. feline immunodeficiency virus (HV) which causes 

30 immunodeficiency, wasting, and encephalitis in cats), and ovine/caprine lentivirus (e.g. 
caprine arthritis-encephalitis virus (CAEV) which causes anemia and wasting in goats, 
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ovine lenti virus, Visna virus which causes pneumonia, wasting, encephalitis and 
arthritis). Preferably, the GAG polypeptides or fragments thereof are from primate 
lenti viruses including, but not limited to, human immunodeficiency virus type 1 (HTV-1), 
human immunodeficiency virus type 2 (HTV-2), human immunodeficiency virus type 3 
5 (KLV-3) (all of which cause AIDS), and various simian immunodeficiency viruses that 
infect hosts such as chimpanzee, mangabey, African Green monkey, mandrill, UHoest, 
Sykes' monkey, or Guereza Colobus monkey. 

Besides the native retroviral GAG polypeptides, useful interacting partners for 
TsglOl or a homologue or derivative or fragment thereof also include homologues of the 

10 native retroviral GAG polypeptides capable of interacting with TsglOl, derivatives of the 
native car homologue GAG polypeptides capable of interacting with TsglOl, fragments of 
the GAG polypeptides capable of interacting with TsglOl (e.g., a fragment containing the 
P(T/S)AP motif), derivatives of the GAG polypeptide fragments, or fusion proteins 
containing (1) a complete GAG polypeptide, (2) a GAG polypeptide homologue capable 

15 of interacting with TsglOl or (3) a GAG polypeptide fragment capable of interacting 
with TsglOl. 

TsglOl fragments capable of interacting with HIV GAGp6 can be the 
combination of molecular engineering of a TsglOl -encoding nucleic acid and a method 
for testing protein-protein interaction. For example, the coordinates in Table 1 can be 

20 used as starting points and various TsglOl fragments falling within the coordinates can 
be generated by deletions from either or both ends of the coordinates. The resulting 
fragments can be tested for their ability to interact with HIV GAGp6 using any methods 
known in the art for detecting protein-protein interactions (e.g., yeast two-hybrid 
method). Likewise, HTV GAG fragments, HIV GAGp6 fragments and other retroviral 

25 GAG polypeptide fragments capable of interacting with TsglOl can also be identified in 
a similar manner. 

In specific embodiments, the protein complex of the present invention contains a 
polypeptide that contains a contiguous span of at least 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 
25, 30, 35, 40, 50 or more amino acid residues of a naturally occurring HTV GAG 
30 sequence. Preferably, the polypeptide contains a contiguous span of between 10 and 20 
amino acid residues of a naturally occurring HIV GAG sequence. The contiguous span 
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should span the HIV late domain motif which can be the P(T/S)AP motif or a variation 
thereof (e.g., the PIAP motif and the PTTP motif). Preferably, the late domain motif in 
the contiguous span is the P(T/S)AP motif. In other specific embodiments, the protein 
complex contains a polypeptide that contains a contiguous span of at least 8, 9, 10, 1 1, 
12, 13, 14, IS, 20, 25, 30, 35, 40, 50 or more amino acid residues of a naturally occurring 
GAG polypeptide sequence from other retroviruses containing a P(T/S)AP late domain 
motif. The contiguous span should span the retrovirus late domain motif. In preferred 
embodiments, such other retroviruses are primate lenti viruses or non-primate lend viruses 
(except for EIAV). In specific embodiments, the protein complex of the present 
invention includes a polypeptide having an amino acid sequence selected from the group 
of EPTAP (SEQ ID NO:7), EPS AP (SEQ ID NO:8), PTAPP (SEQ ID NO:9), PSAPP 
(SEQ ID NO:10) f EPTAPP (SEQ ID NO:ll), EPSAPP (SEQ ID NO:12), PEPTAP(SEQ 
ID NO:13), PEPSAP (SEQ ID NO:14), RPEPTAP (SEQ ID NO:15), RPEPSAP (SEQ ID 
NO:16), PEPTAPP (SEQ ID NO: 17), PEPSAPP (SEQ ID NO: 18), EPTAPPEE (SEQ ID 
NO:19), EPSAPPEE (SEQ ID NO:20), EPTAPPAE (SEQ ID NO:21), PEPTAPPEE 
(SEQ ID NO:22), PEPTAPP AE (SEQ ID NO:23), PEPSAPPEE (SEQ ID NO:24), 
RPEPTAPPEE (SEQ ID NO:25), RPEPSAPPEE (SEQ ID NO:26), RPEPTAPPAE (SEQ 
ID NO:27), RPEPSAPPAE (SEQ ID NO: 28), LQSRPEPTAPPEE (SEQ XD NO:29), 
LQSRPEPSAPPEE (SEQ ID NO:30), LQSRPEPTAPPEES (SEQ ID NO;31), and 
LQSRPEPSAPPEES (SEQ ID NO:32). 

Furthermore, it is believed that the P(T/S)AP or PIAP or PTTP motif itself may 
be sufficient for TsglOl binding. Accordingly, a protein complex is also provided 
containing TsglOl protein or a homologue or derivative or fragment thereof interacting 
with a polypeptide consisting essentially the P(T/S)AP or PIAP or PTTP motif, i.e., a 
polypeptide having the P(T/S)AP or PIAP or PTTP motif and a few flanking amino acids. 

In a specific embodiment of the protein complex of the present invention, two or 
more interacting partners (TsglOl and HIV GAGp6, or homologues, derivatives or 
fragments thereof) are directly fused together, or covalently linked together through a 
peptide linker, forming a hybrid protein having a single unhranched polypeptide chain. 
Thus, the protein complex may be formed by "intramolecular" interactions between two 
portions of the hybrid protein. Again, one or both of the fused or linked interacting 
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partners in this protein complex may be a native protein or a homologue, derivative or 
fragment of a native protein. 

The protein complexes of the present invention can also be in a modified form. 
For example, an antibody selectively immunoreactive with the protein complex can be 
5 bound to the protein complex. In another example, a non-antibody modulator capable of 
enhancing the interaction between the interacting partners in the protein complex may be 
included. Alternatively, the protein members in the protein complex may be cross-linked 
for purposes of stabilization. Various crosslink! rig methods may be used For example, a 
Afunctional reagent in the form of R-S-S-R' may be used in which the R and R* groups 

10 can react with certain amino acid side chains in the protein complex forming covalent 
linkages. See e.g., Trout et aL, in Creighton ed„ Protein Function: A Practical 
Approach DRX. Press, Oxford, 1989; Baud et at, /. Biol Chenu, 251:6953-6962 (1976), 
Other useful crosslinking agents include, e.g., Denny-Jaffee reagent, a heterbiofunctional 
photoactivable moiety cleavable through an azo linkage (See Denny et aL, Proa Natl 

15 Acad. Sri USA, 81 :5286-5290 (1984)), and S-[N-(3-iodo-4- 

azidosaHcyl)cysteaminyl]--2-thiopyridine}, a cysteine-specific photocrosslinking reagent 
(see Chen etaL, Science, 265:90-92 (1994)). 

The above-described protein complexes may further include any additional 
components e.g., other proteins, nucleic acids, lipid molecules, monosaccharides or 

20 polysaccharides, ions or other molecules. 

23. Methods of Preparing Protein Complexes 
The protein complex of the present invention can be prepared by a variety of 
methods. Specifically, a protein complex can be isolated directly from an animal tissue 

25 sample, preferably a human tissue sample containing the protein complex. Alternatively, 
a protein complex can be purified from host cells that recombinantly express the 
members of the protein complex. As will be apparent to a skilled artisan, a protein 
complex can be prepared from a tissue sample or recombinant host cell by 
coimmunoprecipitation using an antibody immunoreactive with an interacting protein 

30 partner, or preferably an antibody selectively immunoreactive with the protein complex 
as will be discussed in detail below. The antibodies can be monoclonal or polyclonal. 
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Coimmunoprecipitation is a commonly used method in the art for isolating or detecting 
bound proteins. In this procedure, generally a serum sample or tissue or cell lysate is 
admixed with a suitable antibody. The protein complex bound to the antibody is 
precipitated and washed. The bound protein complexes are then eluted. 
S Alternatively, immnnoaffinity chromatography and immunobloting techniques 

may also be used in isolating the protein complexes from native tissue samples or 
recombinant host cells using an antibody immunoreactive with an interacting protein 
partner, or preferably an antibody selectively immunoreactive with the protein complex. 
For example, in protein immunoaffinity chromatography, the antibody may be covalently 

10 or non-covalently coupled to a matrix such as Sepharose in, e.g., a column. The tissue 
sample or cell lysate from the recombinant cells can then be contacted with the antibody 
on the matrix. The column is then washed with a low-salt solution to wash off the 
unbound components. The protein complexes that are retained in the column can be then 
eluted from the column using a high-salt solution, a competitive antigen of the antibody, 

15 a chaotropic solvent, or sodium dodecyl sulfate (SDS), or the like. In immunoblotting, 
crude proteins samples from a tissue sample or recombinant host cell lysate can be 
fractionated on a polyacrylamide gel electrophoresis (PAGE) and then transferred to, e.g., 
a nitrocellulose membrane. The location of the protein complex on the membrane may 
be identified using a specific antibody, and the protein complex is subsequently isolated. 

20 In another embodiment, individual interacting protein partners may be isolated or 

purified independently from tissue samples or recombinant host cells using similar 
methods as described above. The individual interacting protein partners are then 
contacted with each other under conditions conducive to the interaction therebetween 
thus forming a protein complex of die present invention. It is noted that different protein- 

25 protein interactions may require different conditions. As a starling point, for example, a 
buffer having 20 mM Tris-HCl, pH 7.0 and 500 mM NaCl may be used. Several 
different parameters may be varied, including temperature, pH, salt concentration, 
reducing agent, and the like. Some minor degree of experimentation may be required to 
determine the optimum incubation condition, this being well within the capability of one 

30 skilled in the art once apprised of the present disclosure. 
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In yet another embodiment, the protein complex of the present invention may be 
prepared from tissue samples or recombinant host cells or other suitable sources by 
protein affinity chromatography or affinity blotting. That is, one of the interacting 
protein partners is used to isolate the other interacting protein partner(s) by binding 
5 affinity thus forming protein complexes. Thus, an interacting protein partner prepared by 
purification from tissue samples or by recombinant expression or chemical synthesis may 
be bound covalently or non-covalently to a matrix such as Sepharose in, e.g., a 
chromatography column. The tissue sample or cell lysate from the recombinant cells can 
then be contacted with the bound protein on the matrix. A low-salt solution is used to 

10 wash off the unbound components, and a high-salt solution is then employed to elute the 
bound protein complexes in the column. In affinity blotting, crude protein samples from 
a tissue sample or recombinant host cell lysate can be fractionated on a polyacryl amide 
gel electrophoresis (PAGE) and then transferred to, e.g., a nitrocellulose membrane. The 
purified interacting protein member is then bound to its interacting protein partners) on 

IS the membrane forming protein complexes, which are then isolated from the membrane. 

It will be apparent to skilled artisans that any recombinant expression methods 
may be used in the present invention for purposes of recombinantly expressing the 
protein complexes or individual interacting proteins. Generally, a nucleic acid encoding 
an interacting protein member can be introduced into a suitable host cell. For purposes of 

20 recombinantly forming a protein complex within a host cell, nucleic acids encoding two 
or more interacting protein members should be introduced into the host cell. 

Typically, the nucleic acids, preferably in the form of DNA, are incorporated into 
a vector to form expression vectors capable of expressing the interacting protein 
member(s) once introduced into a host cell. Many types of vectors can be used for the 

25 present invention. Methods for the construction of an expression vector for purposes of 
this invention should be apparent to skilled artisans apprised of the present disclosure. 
See generally, Current Protocols in Molecular Biology, Vol. 2, Ed. Ausubel, et al. % 
Greene Publish. Assoc. & Wiley Interscience, Ch. 13, 1988; Glover, DNA Cloning, Vol. 
n, ERL Press, Wash., D.C, Ch. 3, 1986; Bitter, et at, in Methods in Enzymology 

30 153:5 16-544 (1987); The Molecular Biology of the Yeast Saccharomyces, Eds. Strathern 
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et al„ Cold Spring Harbor Press, Vols. I and n, 1982; and Sambrook ex al>, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, 1989. 

Generally, the expression vectors include an expression cassette having a 
promoter operably linked to a DNA encoding an interacting protein member. Hie 
5 promoter can be a native promoter, i.e., the promoter found in naturally occurring cells to 
be responsible for the expression of the interacting protein member in the cells. 
Alternatively, the expression cassette can be a dumeric one, i.e., having a heterologous 
promoter that is not the native promoter responsible for the expression of the interacting 
protein member in naturally occurring cells. The expression vector may further include 

10 an origin of DNA replication for the replication of the vectors in host cells. Preferably, 
the expression vectors also include a replication origin for the amplification of the vectors 
in, e.g., E. coli, and selection marker(s) for selecting and maintaining only those host 
cells harboring the expression vectors. Additionally, the expression cassettes preferably 
also contain inducible elements, which function to control the transcription from the 

15 DNA encoding an interacting protein member. Other regulatory sequences such as 
transcriptional enhancer sequences and translation regulation sequences (e.g., Shine- 
Dalgarno sequence) can also be operably included in the expression cassettes. 
Termination sequences such as the polyadenylation signals from bovine growth hormone, 
SV40, lacZ and AcMNPV polyhedral protein genes may also be operably linked to the 

20 DNA encoding an interacting protein member in the expression cassettes. An epitope tag 
coding sequence for detection and/or purification of the expressed protein can also be 
operably linked to the DNA encoding an interacting protein member such that a fusion 
protein is expressed. Examples of useful epitope tags include, but are not limited to, 
influenza virus hemagglutinin (HA), Simian Virus 5 (VS), polyhistidine (6xHis), c-wyc, 

25 lacZ, OST, and the like. Proteins with polyhistidine tags can be easily detected and/or 
purified with Ni affinity columns, while specific antibodies immnnoreactive with many 
epitope tags are generally commercially available. The expression vectors may also 
contain components that direct the expressed protein extracellularly or to a particular 
intracellular compartment. Signal peptides, nuclear localization sequences, endoplasmic 

30 reticulum retention signals, mitochondrial localization sequences, myristoylatton signals, 
palmitoylation signals, and transmembrane sequences are example of optional vector 
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components that can determine the destination of expressed proteins. When it is 
desirable to express two or more interacting protein members in a single host cell, the 
DNA fragments encoding the interacting protein members may be incorporated into a 
single vector or different vectors. 

The thus constructed expression vectors can be introduced into the host cells by 
any techniques known in the art, e.g., by direct DNA transformation, microinjection, 
electroporation, viral infection, tipofection, gene gun, and the like. The expression of the 
interacting protein members may be transient or stable. The expression vectors can be 
maintained in host cells in an extrachromosomal state, i.e., as self-replicating plasmids or 
viruses. Alternatively, the expression vectors can be integrated into chromosomes of the 
host cells by conventional techniques such as selection of stable cell lines or site-specific 
recombination. In stable cell lines, at least the expression cassette portion of the 
expression vector is integrated into a chromosome of the host cells. 

The vector construct can be designed to be suitable for expression in various host 
cells, including but not limited to bacteria, yeast cells, plant cells, insect cells, and 
mammalian and human cells. Methods for preparing expression vectors for expression in 
different host cells should be apparent to a skilled artisan. 

Homologues and fragments of the native interacting protein members can also be 
easily expressed using the recombinant methods described above. For example, to 
express a protein fragment, the DNA fragment incorporated into the expression vector 
can be selected such that it only encodes the protein fragment. Likewise, a specific 
hybrid protein can be expressed using a recombinant DNA encoding the hybrid protein. 
Similarly, a homologue protein may be expressed from a DNA sequence encoding the 
homologue protein or protein fragment. A homologue-encoding DNA sequence may be 
obtained by manipulating the native protein-encoding sequence using recombinant DNA 
techniques. For this purpose, random or site-directed mutagenesis can be conducted 
using techniques generally known in the art. To make protein derivatives, for example, 
the amino acid sequence of a native interacting protein member may be changed in 
predetermined manners by site-directed DNA mutagenesis to create or remove consensus 
sequences for, e g., phosphorylation by protein kinases, giycosylation, ribosylation, 
myristolation, palmytoylation, and the like. Alternatively, non-natural amino acids can 
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be incorporated into an interacting protein member during the synthesis of the protein in 
recombinant host cells. For example, photoreactive lysine derivatives can be 
incorporated into an interacting protein member during translation by using a modified 
lysyl-tRNA. See, e.g., Wiedmann et aL t Nature, 328:830-833 (1989); Musch et al. Cell, 
69:343-352 (1992). Other photoreactive amino acid derivatives can also be incorporated 
in a similar manner. See, e.g., High et al, /. BioL Chenu, 368:2874S-28751 (1993). 
Indeed, the photoreactive amino acid derivatives thus incorporated into an interacting 
protein member can function to cross-link the protein to its interacting protein partner in a 
protein complex under predetermined conditions. 

In addition, derivatives of the native interacting protein members of the present 
invention can also be prepared by chemically linking certain moieties to amino acid side 
chains of the native proteins. 

If desired, the homologues and derivatives thus generated can be tested to 
determine whether they are capable of interacting with their intended interacting partners 
to form protein complexes. Testing can be conducted by e.g., the yeast two-hybrid 
system or other methods known in the art for detecting protein-protein interaction. 

A hybrid protein as described above having TsglOl or a homologue, derivative, 
or fragment thereof covalently linked by a peptide bond or a peptide linker to HIV GAG 
or HIV GAGp6, or a homologue, derivative, or fragment thereof, can be expressed 
recombinant^ from a chimeric nucleic acid, e.g., a DNA or mRNA fragment encoding 
the fusion protein. Accordingly, the present invention also provides a nucleic acid 
encoding the hybrid protein of the present invention. In addition, an expression vector 
having incorporated therein a nucleic acid encoding the hybrid protein of the present 
invention is also provided. The methods for making such chimeric nucleic acids and 
expression vectors containing them should be apparent to skilled artisans apprised of the 
present disclosure. 

2A. Protein Microchip 
In accordance with another embodiment of the present invention, a protein 
microchip or microarray is provided having one or more of the protein complexes and/or 
antibodies selectively immunoreactive with the protein complexes of the present 
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invention. Protein microarrays arc becoming increasingly important in both proteomics 
research and protein-based detection and diagnosis of diseases. Hie protein microarrays 
in accordance with this embodiment of the present invention will be useful in a variety of 
applications including, e.g., large-scale or high- throughput screening for compounds 
5 capable of binding to the protein complexes or modulating the interactions between the 
interacting protein members in the protein complexes. 

The protein microarray of the present invention can be prepared in a number of 
methods known in the art. An example of a suitable method is that disclosed in 
MacBeath and Schreiber, Science, 289:1760-1763 (2000). Essentially, glass microscope 

10 slides are treated with an aldehyde-containing silane reagent (Super Aldehyde Substrates 
purchased from TeleChem International, Cupertino, California). Nanoliter volumes of 
protein samples in a phophate-buffered saline with 40% glycerol are then spotted onto the 
treated slides using a high-precision contact-printing robot After incubation, the slides 
are immersed in a bovine serum albumin (BSA)-containing buffer to quench the 

15 unreacted aldehydes and to form a BSA layer which functions to prevent non-specific 
protein binding in subsequent applications of the microchip. Alternatively, as disclosed 
in MacBeath and Schreiber, proteins or protein complexes of the present invention can be 
attached to a BSA-NHS slide by covalenl linkages. BSA-NHS slides are fabricated by 
first attaching a molecular layer of BSA to the surface of glass slides and then activating 

20 the BSA with N,N'-disuccinimidyl carbonate. As a result, the amino groups of the 

lysine, asparate, and glutamate residues on the BSA are activated and can form covalent 
urea or amide linkages with protein samples spotted on the slides. See MacBeath and 
Schreiber, Science, 289:1760-1763 (2000). 

Another example of useful method for preparing the protein microchip of the 

25 present invention is that disclosed in PCT Publication Nos. WO 00/4389A2 and WO 
00/04382, both of which are assigned to Zyomyx and are incorporated herein by 
reference. First, a substrate or chip base is covered with one or more layers of thin 
organic film to eliminate any surface defects, insulate proteins from the base materials, 
and to ensure uniform protein array. Next, a plurality of protein-capturing agents (e.g., 

30 antibodies, peptides, etc.) are arrayed and attached to the base that is covered with the 
thin film. Proteins or protein complexes can then be bound to the capturing agents 
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forming a protein microarray. The protein microchips are kept in flow chambers with an 
aqueous solution. 

The protein microarray of the present invention can also be made by the method 
disclosed in PCT Publication No. WO 99/36576 assigned to Packard Bioscience 
Company, which is incorporated herein by reference. For example, a three-dimensional 
hydiophilic polymer matrix, i.e., a gel, is first disposed on a solid substrate such as a 
glass slide. The polymer matrix gel is capable of expanding or contracting and contains a 
coupling reagent that reacts with amine groups. Thus, proteins and protein complexes 
can be contacted with the matrix gel in an expanded aqueous and porous state to allow 
reactions between the amine groups on the protein or protein complexes with the 
coupling reagents thus immobilizing the proteins and protein complexes on the substrate. 
Thereafter, the gel is contracted to embed the attached proteins and protein complexes in 
the matrix gel. 

Alternatively, the proteins and protein complexes of the present invention can be 
incorporated into a commercially available protein microchip, e.g., the ProteinChip 
System from Ciphergen Biosystems Inc., Palo Alto, CA. The ProteinChip System 
comprises metal chips having a treated surface, which interact with proteins. Basically, a 
metal chip surface is coated with a silicon dioxide film. The molecules of interest such as 
proteins and protein complexes can then be attached covalently to the chip surface via a 
silane coupling agent 

The protein microchips of the present invention can also be prepared with other 
methods known in the art, e.g. f those disclosed in U.S. Patent Nos. 6,087,102, 6,139,831, 
6,087,103; PCT Publication Nos. WO 99/60156, WO 99/39210, WO 00/54046, WO 
00/53625, WO 99/51773, WO 99/35289, WO 97/42507, WO 01/01142, WO 00/63694, 
WO 00/61806, WO 99/61148, WO 99/40434, all of which are incorporated herein by 
reference. 

3. Antibodies 

In accordance with another aspect of the present invention, an antibody 
immunoreactive against a protein complex of the present invention is provided, hi one 
embodiment, the antibody is selectively immunoreactive with a protein complex of the 
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present invention. Specifically, the phrase "selectively immun ore active with a protein 
complex" as used herein means that the immunoreactivity of the antibody of the present 
invention with the protein complex is substantially higher than that with the individual 
interacting members of the protein complex so that the binding of the antibody to die 
protein complex is readily distinguishable from the binding of die antibody to the 
individual interacting member proteins based on the strength of the binding affinities. 
Preferably, the binding constant differs by a magnitude of at least 2 fold, more preferably 
at least 5 fold, even more preferably at least 10 fold, and most preferably at least 100 
fold. In a specific embodiment, the antibody is not substantially immunoreactive with the 
interacting protein members of the protein complex. 

The antibody of the present invention can be readily prepared using procedures 
generally known in the art See, e.g. 9 Harlow and Lane, Antibodies: A Laboratory 
Manual, Cold Spring Harbor Press, 1988. Typically, the protein complex against which 
the antibody to be generated will be immunoreactive is used as die antigen for the 
purpose of producing immune response in a host animal. In one embodiment, die protein 
complex used consists the native proteins. Preferably, the protein complex includes only 
the binding domains of TsglOl and HIV GAGp6, respectively. As a result, a greater 
portion of die total antibodies may be selectively immunoreactive with the protein 
complexes. The binding domains can be selected from, e.g., those summarized in Table 
1. In addition, various techniques known in the art for predicting epitopes may also be 
employed to design antigenic peptides based on the interacting protein members in a 
protein complex of die present invention to increase the possibility of producing an 
antibody selectively immunoreactive with the protein complex. Suitable epitope- 
prediction computer programs include, e.g., Mac Vector from International 
Biotechnologies, Inc. and Protean from DNAStar. 

In a specific embodiment, a hybrid protein as described above in Section 2 is used 
as an antigen which has TsglOl or a horaologues, derivative, or fragment thereof 
covalently linked by a peptide bond or a peptide linker to HIV GAG or HTV GAGp6 or a 
homologue, derivative, or fragment thereof. En a preferred embodiment, the hybrid 
protein consists of two interacting binding domains selected from Table 1, or homologues 
or derivatives thereof, covalently linked together by a peptide bond or a linker molecule. 
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The antibody of the present invention can be a polyclonal antibody to a protein 
complex of the present invention. To produce the polyclonal antibody, various animal 
hosts can be employed, including, e.g,, mice, rats, rabbits, goats, guinea pigs, hamsters, 
etc. A suitable antigen which is a protein complex of the present invention or a 
5 derivative thereof as described above can be administered directly to a host animal to 
illicit immune reactions. Alternatively, it can be- administered together with a carrier 
such as keyhole limpet hemocyanin (KLH), bovine serum albumin (BSA), ovalbumin, 
and Tetanus toxoid. Optionally, the antigen is conjugated to a earner by a coupling agent 
such as carbodiimide, glutaraldehyde, and MBS. Any conventional adjuvants may be 

10 used to boost the immune response of the host animal to the protein complex antigen. 
Suitable adjuvants known in the art include but are not limited to Complete Freund 4 s 
Adjuvant (which contains killed mycobacterial cells and mineral oil), incomplete 
Freund's Adjuvant (which lacks the cellular components), al uminum salts, MF59 from 
Biocine, monophospholipid, synthetic trehalose dicorynomycolate (TDM) and cell wall 

15 skeleton (CWS) both from RIBI ImmunoChem Research Inc., Hamilton, MT, non-ionic 
surfactant vesicles (NISV) from Proteus International PLC, Cheshire, ILK,, and saponins. 
The antigen preparation can be administered to a host animal by subcutaneous, 
intramuscular, intravenous, intradermal, or intraperitoneal injection, or by injection into a 
lymphoid organ. 

20 The antibodies of the present invention may also be monoclonal. Such 

monoclonal antibodies may be developed using any conventional techniques known in 
the art For example, the popular hybridoma method disclosed in Kohler and Milstein, 
Nature, 256:495-497 (1975) is now a well-developed technique that can be used in the 
present invention. See U.S. Patent No. 4,376,1 10, which is incorporated herein by 

25 reference. Essentially, B-lymphocytes producing a polyclonal antibody against a protein 
complex of the present invention can be fused with myeloma cells to generate a library of 
hybridoma clones. The hybridoma population is then screened for antigen binding 
specificity and also for immunoglobulin class (isotype). In this manner, pure hybridoma 
clones producing specific homogenous antibodies can be selected. See generally, Harlow 

30 and Lane, Antibodies; A Laboratory Manual, Cold Spring Harbor Press, 1988. 

Alternatively, other techniques known in the art may also be used to prepare monoclonal 
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antibodies, which include bnt are not limited to the EB V hybridoma technique, the 
human N-cell hybridoma technique, and the trioma technique. 

In addition, antibodies selectively iznmunoreactive with a protein complex of the 
present invention may also be recombinantly produced For example, cDNAs prepared 
5 by PCR amplification from activated B -lymphocytes or hybridomas may be cloned into 
an expression vector to form a cDNA library, which is then introduced into a host cell far 
recombinant expression. The cDNA encoding a specific desired protein may then be 
isolated from the library. The isolated cDNA can be introduced into a suitable host cell 
for the expression of the protein. Thus, recombinant techniques can be used to 

1 0 recombinantly produce specific native antibodies, hyhrid antibodies capable of 

simultaneous reaction with more than one antigen, chimeric antibodies (e.g., the constant 
and variable regions are derived from different sources), univalent antibodies which 
comprise one heavy and light chain pair coupled with the ft region of a third (heavy) 
chain, Fab proteins, and the like. See U.S. Patent No. 4,816,567; European Patent 

15 Publication No. 0088994; Munro, Nature, 312:597 (1984); Morrison, Science, 229:1202 
(1985); Oi et aL, BioTechniques, 4:214 (1986); and Wood et al., Nature, 314:446-449 
(1985), all of which are incorporated herein by reference. Antibody fragments such as Fv 
fragments, single-chain Fv fragments (scPv), Fab 1 fragments, and F(ab , )2 fragments can 
also be recombinantly produced by methods disclosed in, e.g., U.S. Patent No, 4,946,778; 

20 Skerra & Flucklfaiin, Science, 240:1038-1041(1988); Better et aL, Science, 240:1041- 

1043 (1988); and Bird, et aL, Science, 242:423-426 (1988), all of which are incorporated 
herein by reference. 

In a preferred embodiment, the antibodies provided in accordance with the present 
invention are partially or folly humanized antibodies. For this purpose, any methods 

25 known in the art may be used. For example, partially humanized chimeric antibodies 
having V regions derived from the tumor-specific mouse monoclonal antibody, but 
human C regions are disclosed in Morrison and Oi, Adv. Immunol, 44:65-92 (1989). In 
addition, fully humanized antibodies can be made using transgenic non-human animals. 
For example, transgenic non-human animals such as transgenic mice can be produced in 

30 which endogenous immunoglobulin genes are suppressed or deleted, while heterologous 
antibodies are encoded entirely by exogenous immunoglobulin genes, preferably human 
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immunoglobulin genes, recombinantly introduced into the genome. See e.g mf U.S. Patent 
Nos. 5,530,101; 5,545,806; 6,075,181; PCT Publication No, WO 94/02602; Green et. aL, 
Nat. Genetics, 7: 13-21 (1994); and Lonberg etal., Nature 368: 856-859 (1994), all of 
which are incorporated herein by reference. The transgenic non-human host animal may 
5 be immunized with suitable antigens such as a protein complex of the present invention 
or one or more of the interacting protein members thereof to illicit specific immune 
response thus producing humanized antibodies. In addition, cell lines producing specific 
humanized antibodies can also be derived from the immunized transgenic non-human 
animals. For example, mature B-lymphocytes obtained from a transgenic animal 

1 0 producing humanized antibodies can be fused to myeloma cells and the resulting 
hybridoma clones may be selected for specific humanized antibodies with desired 
binding specificities. Alternatively, cDNAs may be extracted from mature B- 
lymphocytes and used in establishing a library which is subsequendy screened for clones 
encoding humanized antibodies with desired binding specificities. 

15 In yet another embodiment, a bifunctional antibody is provided which has two 

different antigen binding sites, each being specific to a different interacting protein 
member in a protein complex of the present invention. The bifunctional antibody may be 
produced using a variety of methods known in the art For example, two different 
monoclonal antibody-producing hybridomas can be fused together. One of the two 

20 hybridomas may produce a monoclonal antibody specific against an interacting protein 
member of a protein complex of the present invention, while the other hybridoma 
generates a monoclonal antibody immunoreactive with another interacting protein 
member of the protein complex. The thus formed new hybridoma produces different 
antibodies including a desired bifunctional antibody, i.e., an antibody immunoreactive 

25 with both of the interacting protein members. The bifunctional antibody can be readily 
purified. See Milstein and Cuello, Nature, 305:537-540 (1983), 

Alternatively, a bifunctional antibody may also be produced using 
heterobifunctional crosslinkers to chemically link two different monoclonal antibodies, 
each being immunoreactive with a different interacting protein member of a protein 

30 complex. Therefore, the aggregate will bind to two interacting protein members of the 
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protein complex. See Staerz et al, Nature, 314:628-631(1985); Perez et aL Nature, 

316:354-356 (1985), 

In addition, biftmctional antibodies can also be produced by recombinantly 

expressing light and heavy chain genes in a hybridoma that itself produces a monoclonal 
5 antibody. As a result, a mixture of antibodies including a Afunctional antibody is 

produced See DeMonte etal, Proc. Nad. Acad. Sci. t USA, 87:2941-2945 (1990); Lenz 

and Weidle, Gene, 87:213-218 (1990). 

Preferably, a biftmctional antibody in accordance with the present invention is 

produced by the method disclosed in U.S. Patent No. 5,582,996, which is incorporated 
10 herein by reference. For example* two different Fabs can be provided and mixed 

together. The first Fab can bind to an interacting protein member of a protein complex, 

and has a heavy chain constant region having a first complementary domain not naturally 

present in the Fab but capable of binding a second complementary domain. The second 

Fab is capable of binding another interacting protein member of the protein complex, and 
1 5 has a heavy chain constant region comprising a second complementary domain not 

naturally present in the Fab but capable of binding to the first complementary domain. 

Each of the two complementary domains is capable of stably binding to the other but not 

to itself. For example, the leucine zipper regions of c-fos and c-jun oncogenes may be 

used as die first and second complementary domains. As a result, the first and second 
20 complementary domains interact with each other to form a leucine zipper thus associating 

the two different Fabs into a single antibody construct capable of binding to two antigenic 

sites. 

Other suitable methods known in the art for producing biftmctional antibodies 
may also be used, which include those disclosed in Holliger et aL, Proc, Nat'lAcad. Set 
25 USA 90:6444-6448 (1993); de Kruif etaL, J. Biol Chem., 271:7630-7634 (1996); 

Coloma and Morrison, Nat Biotechnol, 15:159-163 (1997); Muller etal., FEBS Lett., 
422:259-264 (1998); and Muller et at., FEBS Lett., 432:45-49 (1998), all of which are 
incorporated herein by reference. 

30 
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4. Screening Assays 
Hie protein complexes of the present invention, TsglOl and HIV GAG or HIV 
GAGp6 can be used in screening assays to select modulators of TsglOl, HTV GAGp6, 
and protean complexes of the present invention. In addition, homologues, derivatives and 
5 fragments of TsglOl, HTV GAG, HTV GAGp6, and protein complexes containing such 
homologies, derivatives and fragments may also be used in the screening assays. As 
used herein, the term "modulator" encompasses any compounds that can cause any forms 
of alteration of the properties, biological activities or functions of the proteins or protein 
complexes, including, e.g., enhancing or reducing their biological activities, increasing or 

10 decreasing their stability, altering their affinity or specificity to certain other biological 
molecules, etc. In addition, the term "modulator" as used herein also includes any 
compounds that simply bind TsglOl , HTV GAGp6, and/or the proteins complexes of the 
present invention. For example, a modulator can be a an interaction antagonist capable of 
interfering with, or disrupting or dissociating protein-protein interaction between TsglOl 

IS or a homologue or derivative thereof and HTV GAGp6 or a homologue or derivative 
thereof. 

Likewise, other retroviral GAG polypeptides containing the P(T/S)AP late 
domain motif, or homologues, derivatives or fragments thereof may also be used in the 
screening assays in lieu of HIV GAG or HTV GAGp6, In other words, any interacting 

20 members of the protein complexes provided according to the present invention (e.g., as 
described in Section 2) can be used in the screening assays. 

The term "interaction antagonist" as used herein means a compound that 
interferes with, blocks, disrupts or destabilizes a protein-protein interaction; blocks or 
interferes with the formation of a protein complex; or destabilizes, disrupts or dissociates 

25 an existing protein complex. 

The term "interaction agonist" as used herein means a compound that triggers, 
initiates, propagates, nucleates, or otherwise enhances the formation of a protein-protein 
interaction; triggers, initiates, propagates, nucleates, or otherwise enhances the formation 
of a protein complex; or stabilizes an existing protein complex. 

30 Accordingly, the present invention provides screening methods for selecting 

modulators of TsglOl or HIV GAGp6 or a mutant form thereof, or a protein complex 
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formed between TsglOl or a homologue or derivative or fragment thereof and HIV GAG 
or HIV GAGp6 or a homologue or derivative or fragment thereof. The targets suitable in 
the screening methods of the present invention may include any embodiments of the 
protein complexes of the present invention as described in Section 2. Preferably, protein 
5 fragments are used in forming the protein complexes. For example, a preferred target 
protein complex can include a TsglOl protein fragment encompassing the UEV domain. 
Also for example, the HIV GAGp6 or a fragment thereof may be used in forming a target 
protein complex. In a specific embodiment, a polypeptide including the first 14 amino 
acids of the HTV GAGp6 is used in forming a target protein complex. Hi another 

10 embodiment, fusion proteins are used in which a detectable epitope tag is fused to a 

TsglOl protein or a homologue or derivative or fragment thereof and/or to a HTV GAGp6 
polypeptide or a homologue or derivative or fragment thereof. Suitable examples of such 
epitope tags include sequences derived from* e.g., influenza virus hemagglutinin (HA), 
Simian Virus 5 (VS), polyhistidine (6xHis), c-myc t lacZ, GST, and the like, 

IS When TsglOl protein or a homologue or derivative or fragment thereof is used as 

a target protein in the screening methods of the present invention, preferably the TsglOl 
UEV domain is included in the TsglOl protein or a homologue or derivative or fragment 
thereof. And preferably the TsglOl protein or a homologue or derivative or fragment 
thereof is fused to a detectable tag such as sequences derived from, e.g., influenza virus 

20 hemagglutinin (HA), Simian Virus 5 (V5), polyhistidine (6xHSs), c-myc, lacZ, GST, and 
the like. In this respect, compounds selected by the methods capable of binding to 
TsglOl protein, preferably the UEV domain of TsglOl protein can be tested for their 
ability to inhibit or interfere with the interactions between TsglOl and HTV GAGp6. 
They can also be tested for their ability to inhibit HIV viral budding or HTV propagation. 

25 Suitable methods for such testing should be apparent to skilled artisan apprised of the 
present disclosure. 

The modulators selected in accordance with the screening methods of the present 
invention can be effective in modulating die functions or activities of TsglOl, HIV GAG, 
HTV GAGp6, other retroviral GAG polypeptides containing the P(T/S)AP motif, or the 
30 protein complexes of the present invention. For example, compounds capable of binding 
the protein complexes may be capable of modulating the functions of the protein 
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complexes. Additionally, compounds that interfere with, weaken, dissociate or disrupt, 
or alternatively, initiate, facilitate or stabilize the protein-protein interaction between the 
interacting protein members of the protein complexes can also be effective in modulating 
the functions or activities of the protein complexes. Thus, the compounds identified in 
5 the screening methods of the present invention can be made into therapeutically or 
prophylactically effective drugs for preventing or ameliorating diseases, disorders or 
symptoms caused by or associated with the protein complexes or TsglOl or HIV GAG or 
HIV GAGp6 or other retroviral GAG polypeptides. Alternatively, they may be used as 
leads to aid the design and identification of therapeutically or prophylactically effective 

10 compounds for diseases, disorders or symptoms caused by or associated with the protein 
complexes of the present invention or interacting members thereof. The protein 
complexes and/or interacting protein members thereof in accordance with the present 
invention can be used m any of a variety of drug screening techniques. Drug screening 
can be performed as described herein or using well-known techniques, such as those 

15 described in U.S. Patent Nos. 5,800,998 and 5,891,628, both of which are incorporated 
herein by reference. 

4*1. Test Compounds 
Any test compounds may be screened in the screening assays of the present 

20 invention to select modulators of a TsglOl-containing protein complex of the present 
invention or interacting members thereof . By the term "selecting" or "select" 
modulators it is intended to encompass both (a) choosing compounds from a group 
previously unknown to be modulators of a TsglOl-containing protein complex of the 
present invention and/or an interacting member thereof, and (b) testing compounds that 

25 are known to be capable of binding, or modulating the functions and activities of, a 
TsglOl-containing protein complex of the present invention and/or an interacting 
member thereof. Both types of compounds are generally referred to herein as "test 
compounds." The test compounds may include, by way of example, proteins (e.g., 
antibodies, small peptides, artificial or natural proteins), nucleic adds, and derivatives, 

30 mim etics and analogs thereof, and small organic molecules having a molecular weight of 
no greater than 10,000 daltons, more preferably less than 5,000 daltons. Preferably, the 
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test compounds are provided in library formats known in the art, e.g., in chemically 
synthesized libraries, recombinant expression libraries (e.g., phage display libraries), and 
in vitro translation-based libraries (e.g., ribosome display libraries). 

For example, the screening assays of the present invention can be used in the 
5 antibody production processes described in Section 3 to select antibodies with desirable 
specificities. Various forms antibodies or derivatives thereof may be screened, including 
but not limited to, polyclonal antibodies, monoclonal antibodies, bifunctional antibodies, 
chimeric antibodies, single chain antibodies, antibody fragments such as Fv fragments, 
single-chain Fv fragments (scFv), Fab' fragments, and F(ab'>2 fragments, and various 

10 modified forms of antibodies such as catalytic antibodies, and antibodies conjugated to 
toxins or drugs, and the like. The antibodies can be of any types such as IgG, IgE, IgA, 
or IgM. Humanized antibodies are particularly preferred. Preferably, the various 
antibodies and antibody fragments may be provided in libraries to allow large-scale high 
throughput screening. For example, expression libraries expressing antibodies ox 

15 antibody fragments may be constructed by a method disclosed, e.g., in Huse et aL t 

Science, 246:1275-1281 (1989), which is incorporated herein by reference. Single-chain 
Fv (scFv) antibodies are of particular interest in diagnostic and therapeutic applications. 
Methods for providing antibody libraries are also provided in U.S. Patent Nos. 6,096^51; 
5,844,093; 5,837,460; 5,789,208; and 5,667,988, all of which are incorporated herein by 

20 reference. 

Peptidic test compounds may be peptides having L- amino acids and/or D- amino 
acids, phosphopeptides, and other types of peptides* The screened peptides can be of any 
size, but preferably have less than about 50 amino acids. Smaller peptides are easier to 
deliver into a patient's body. Various forms of modified peptides may also be screened. 

25 like antibodies, peptides can also be provided in, e.g., combinatorial libraries. See 
generally, Gallop et al„ /, Med. Ckem., 37:1233-1251 (1994). Methods for making 
random peptide libraries are disclosed in, e.g., Devlin et al, Science, 249:404406 (1990). 
Other suitable methods for constructing peptide libraries and screening peptides 
therefrom are disclosed in, e.g., Scott and Smith, Science, 249:386-390 (1990); Moran et 

30 aU /. Am. Chem. Soc., 1 17:10787-10788 (1995) (a library of electronically tagged 
synthetic peptides); Stachelhaus etaL, Science, 269:69-72 (1995); U.S. Patent Nos. 
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6,156,511; 6,107,059; 6,015,561; 5,750,344; 5,834,318; 5,750,344, aU of which are 
incorporated herein by reference. For example, random-sequence peptide phage display 
libraries may be generated by cloning synthetic oligonucleotides into the gene H or gene 
Vm of an £. colt filamentous phage. The thus generated phage can propagate in E. colt 
5 and express peptides encoded by the oligonucleotides as fusion proteins on die surface of 
the phage. Scott and Smith, Science, 249:368-390 (1990). Alternatively, the "peptides 
on plasmids" method may also be used to form peptide libraries. In this method, random 
peptides may be fused to the C-terminus of the & colL Lac repressor by recombinant 
technologies and expressed from a plasznid that also contains Lac repressor-binding sites. 

10 As a result, the peptide fusions bind to the same plasmid that encodes them. 

Small organic or inorganic non-peptide non-nucleotide compounds having a 
molecular weight of less than 5,000 daltons ate preferred test compounds for the 
screening assays of the present invention. They too can be provided in a library format 
See generally, Gordan et aL J. Med. Clxem., 37:1385-1401 (1994). For example, 

15 benzodiazepine libraries are provided in Bunin and Tollman, J. Am* Cherru 

Soc, 114: 10997-10998 (1992), which is incorporated herein by reference. A method for 
constructing and screening peptoid libraries are disclosed in Simon et at, Proc. Natl 
Acad. ScL USA, 89:9367-9371 (1992). Methods for the biosynthesis of novel polyketides 
in a library format are described in McDaniel et al, Science, 262:1546-1550 (1993) and 

20 Kao et at, Science, 265:509-512 (1994). Various libraries of small organic molecules 
and methods of construction thereof are disclosed in U.S. Patent Nos. 6,162,926 
(multiply-substituted fullerene derivatives); 6,093,798 (hydroxamic acid derivatives); 
5,962,337 (combinatorial l,4-benzodiazepin-2, 5-dione library); 5,877,278 (Synthesis of 
N-substituted oligomers); 5,866,341 (compositions and methods for screening drug 

25 libraries); 5,792,821 (polymerizable cyclodextrin derivatives); 5,766,963 

(hydroxypropylamine library); and 5,698,685 (morpholino-subunit combinatorial 
library), all of which are incorporated herein by reference. 

Other compounds such as oligonucleotides and peptide nucleic acids (FNA), and 
analogs and derivatives thereof may also be screened to select clinically useful 

30 compounds. Combinatorial libraries of oligos are also known in die art See Gold et at, 
/. Biot Chenu, 270:13581-13584(1995). 
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4.Z In vitro Screening Assays 
The test compounds may be screened in an in vitro assay to select compounds 
capable of binding the protein complexes or interacting protein members thereof in 
5 accordance with the present invention. Far this purpose, a test compound is contacted 
with a protein complex or an interacting protein member thereof under conditions and for 
a time sufficient to allow specific interaction between Hie test compound and the target 
components to occur and thus binding of the compound to the target forming a complex. 
Subsequently, the binding event is detected. 

10 Various screening techniques known in the art may be used in the present 

invention. The protein complexes and the interacting protein members thereof may be 
prepared by any suitable methods, e.g., by recombinant expression and purification. Hie 
protein complexes and/or interacting protein members thereof (both are referred to as 
"target" hereinafter in this section) may be free in solution or in cell extracts. A test 

15 compound may be mixed with a target forming a liquid mixture. The compound may be 
labeled with a detectable marker. Upon mixing under suitable conditions, the binding 
complex having the compound and die target may be co-immunqprecipitated and washed. 
The compound in the precipitated complex may be detected based on the matter on the 
compound. 

20 In a preferred embodiment, the target is immobilized on a solid support or on a 

cell surface. Preferably, the target can be arrayed into a protein microchip in a method 
described m Section 2. For example, a target may be immobilized directly onto a 
microchip substrate such as glass slides or onto a multi-well plates using non-neutralizing 
antibodies, i.e., antibodies capable of binding to the target but do not substantially affect 

25 its biological activities. To effect the screening, test compounds can be contacted with 
the immobilized target to allow binding to occur to form complexes under standard 
binding assay conditions. Either the targets or test compounds are labeled with a 
detectable marker using well-known labeling techniques. For example, U.S. Patent No. 
5,741,713 discloses combinatorial libraries of biochemical compounds labeled with NMR 

30 active isotopes. To select binding compounds, one may measure the formation of the 
target-test compound complexes or kinetics for the formation thereof. When 
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combinatorial libraries of organic non-peptide non-nucleic acid compound are screened, 
it is preferred that labeled or encoded (or "tagged") combinatorial libraries are used to 
allow rapid decoding of lead structures. This is especially important because, unlike 
biological libraries, individual compounds found in chemical libraries cannot be 
5 amplified by self-amplification* Tagged combinatorial libraries are provided in, e.g., 
Borchardt and Still, J. Am. Chem. Soc., 116:373-374 (1994) and Moran et aL, J. Am. 
Chem. Soc., 117:10787-10788 (1995), both of which are incorporated herein by 
reference. 

Alternatively, the test compounds can be immobilized on a solid support, e.g., 

10 forming a mciroarray of test compounds. Hie target protein or protein complex is then 
contacted with the test compounds. The target may be labeled with any suitable detection 
marker. For example, the target may be labeled with radioactive isotopes or fluorescence 
marker before binding reaction occurs. Alternatively, after the binding reactions, 
antibodies that arc immunoreactive with the target and are labeled with radioactive 

IS materials, fluorescence markers, enzymes, or labeled secondary anti-Ig antibodies may be 
used to detect any bound target thus selecting the binding compound. One example of 
this embodiment is the protein probing method. That is, the target provided in 
accordance with the present invention is used as a probe to screen expression libraries of 
proteins or random peptides. The expression libraries can be phage display libraries, in 

20 vitro translation-based libraries, or ordinary expression cDNA libraries. The libraries 

may be immobilized on a solid support such as nitrocellulose filters. See e.g., Sikela and 
Hahn, Proc. Nad. Acad. ScL USA, 84:3038-3042 (1987)- The probe may be labeled by a 
radioactive isotope or a fluorescence marker. Alternatively, the probe can be biotinylated 
and detected with a streptavidin-alkaline phosphatase conjugate. More conveniently, the 

25 bound probe may be detected with an antibody. 

In yet another embodiment, a known Iigand capable of binding to the target can 
be used in competitive binding assays. Complexes between the known ligand and the 
target can be formed and then contacted with test compounds. The ability of a test 
compound to interfere with die interaction between the target and the known ligand is 

30 measured. One exemplary ligand is an antibody capable of specifically binding the 
target Particularly, such an antibody is especially useful for identifying peptides that 
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share one or more antigenic determinants of the target protein complex or interacting 
protein members thereof. 

In a specific embodiment, a protein complex used in the screening assay includes 
a hybrid protein as described in Section 2, which is formed by fusion of two interacting 
protein members or fragments or domains thereof. The hybrid protein may also be 
designed such that it contains a detectable epitope tag fused thereto. Suitable examples 
of such epitope tags include sequences derived from, e.g., influenza virus hemagglutinin 
(HA), Simian Virus 5 (V5), polyhistidine (6xHis), c-myc f lacZ, GST, and the like. 

Test compounds may also be screened in an in vitro assay to select interaction 
antagonists of the protein complexes identified in accordance with the present invention. 
Thus, for example, a TsglOl-HIV OAGp6 protein complex can be contacted with a test 
compound and disruption or destabilization of the protein complex can be detected. 

The assay can be conducted in similar manners as the binding assays described 
above. For example, the presence or absence of a particular protein complex can be 
detected by an antibody selectively immunoreactive with die protein complex. Thus, 
after incubation of die protein complex with a test compound, immunoprecipitation assay 
can be conducted with the antibody. If the test compound disrupts the protein complex, 
then die amount of immunoprecipitated protein complex in this assay will be significantly 
less than that in a control assay in which the same protein complex is not contacted with 
the test compound. Various other detection methods may be suitable in the dissociation 
assay, as will be apparent to skilled artisan apprised of the present disclosure. In one 
embodiment, one of the interacting partner with a detectable marker fused thereto is fixed 
to a solid support For example, a GST-GAGp6 fusion protein is attached to a solid 
support Then the other interacting partner with a detectable marker fused thereto (e.g., a 
myc-tagged TsglOl fragment containing the UEV domain) is contacted with the 
immobilized first interacting partner in the presence of one or more test compounds. If 
binding between the two interacting partners occurs, the myc-tagged TsglOl fragment is 
also immobilized, which can be detected using an anti-myc antibody after the binding 
reaction mixture is washed to remove unbound myc-tagged TsglOl fragment 

4,3. In vivo Screening Assays 
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Test compounds can also be screened in any in vivo assays to select modulators of 
the protein complexes or interacting protein members thereof in accordance with the 
present invention. For example, any in vivo assays known in the ait useful in selecting 
compounds capable of strengthening or interfering with the stability of the protein 
complexes of the present invention may be used. 

4,3.1. Two-Hybrid Assays 
In a preferred embodiment, one of the yeast two-hybrid systems or their 
analogous or derivative forms is used. Examples of suitable two-hybrid systems known 
in the art include, but are not limited to, those disclosed in U.S. Patent Nos. 5,283,173; 
5,525,490; 5,585,245; 5,637,463; 5,695,941; 5,733,726; 5,776,689; 5,885,779; 5,905,025; 
6,037,136; 6,057,101; 6,114,111; and Bartel and Relds, eds., The Yeast Two-Hybrid 
System, Oxford University Press, New York, NY, 1997, all of which are incorporated 
herein by reference. 

Typically, in a classic transcription-based two-hybrid assay, two chimeric genes 
are prepared encoding two fusion proteins: one contains a transcription activation domain 
fused to an interacting protein member of a protein complex of the present invention or 
an interacting domain of the interacting protein member, while the other fusion protein 
includes a DNA binding domain fused to another interacting protein member of the 
protein complex or an interacting domain thereof. For the purpose of convenience, the 
two interacting protein members or interacting domains thereof are referred to as "bait 
fusion protein" and 4< prey fusion protein," respectively. The chimeric genes encoding the 
fusion proteins are termed "bait chimeric gene" and '*prey chimeric gene," respectively. 
Typically, a "bait vector" and a "prey vector" are provided for the expression of a bait 
chimeric gene and a prey chimeric gene, respectively. 

43.1.1. Vectors 

Many types of vectors can be used in a transcription-based two-hybrid assay. 
Methods for the construction of bait vectors and prey vectors should be apparent to 
skilled artisans in die art apprised of the present disclosure. See generally, Current 
Protocols in Molecular Biology, Vol. 2, Ed, Ausubel, et a/., Greene Publish. Assoc. & 
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Wiley Interscience, Ch. 13, 1988; Glover, DNA Cloning, Vol. II, 1RL Press, Wash., D.C., 
Oh 3, 1986; Bitter, et aL, in Methods in Enzymology 153:516-544 (1987); The Molecular 
Biology of the Yeast Saccharomyces, Eds. Strathem et al„ Cold Spring Harbor Press, 
Vols. I and II, 1982; and Rothstein in DNA Cloning: A Practical Approach, Vol. 11, Ed 
5 DM Glover, IRL Press, Wash., D.C., 1986. 

Generally, the bait and prey vectors include an expression cassette having a 
promoter operably linked to a chimeric gene for die transcription of the chimeric gene. 
The vectors may also include an origin of DNA replication for the replication of the 
vectors in host cells and a replication origin for the amplification of the vectors in, e.g., E. 

10 coli> and selection markers) for selecting and maintaining only those host cells harboring 
the vectors. Additionally, the expression cassette preferably also contains inducible 
elements, which function to control the expression of a chimeric gene. Making the 
expression of the chimeric genes inducible and controllable is especially important in the 
event that the fusion proteins or components thereof are toxic to the host cells. Other 

15 regulatory sequences such as transcriptional enhancer sequences and translation 
regulation sequences (e.g., Shine-Dalgarno sequence) can also be included in the 
expression cassette. Termination sequences such as the bovine growth hormone, SV40, 
lacZ and AcMNPV polyhedral polyadenylation signals may also be operably linked to a 
chimeric gene in the expression cassette. An epitope tag coding sequence for detection 

20 and/or purification of the fusion proteins can also be operably linked to the chimeric gene 
in the expression cassette. Examples of useful epitope tags include, but are not limited to, 
influenza virus hemagglutinin (HA), Simian Virus 5 (V5), polyhistidine (6xHSs), c-iwyc, 
lacZ, GST, and the like. Proteins with polyhistidine tags can be easily detected and/or 
purified with Ni affinity columns, while specific antibodies to many epitope tags are 

25 generally commercially available. The vectors can be introduced into the host cells by 
any techniques known in the art, e.g., by direct DNA transformation, microinjection, 
electroporation, viral infection, lipofection, gene gun, and the like. The bait and prey 
vectors can be maintained in host cells in an extrachromosomal state, i.e., as self- 
replicating plasmids or viruses. Alternatively, one or both vectors can be integrated into 

30 chromosomes of the host cells by conventional techniques such as selection of stable cell 
lines or site-specific recombination. 
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Hie in vivo assays of the present invention can be conducted in many different 
host cells, including but not limited to bacteria, yeast cells, plant cells, insect cells, and 
mammalian cells. A skilled artisan will recognize that the designs of the vectors can vary 
with the host cells used. In one embodiment, the assay is conducted in prokaryotic cells 
5 such as Escherichia coU % Salmonella, Klebsiella, Pseudomonas, Caulobacter, and 
Rhizobium. Suitable origins of replication for the expression vectors useful in tins 
embodiment of the present invention include, e.g., the ColEl, pSClOl, and M13 origins 
of replication. Examples of suitable promoters include, for example, the T7 promoter, 
the lacZ promoter, and the like. In addition, inducible promoters are also useful in 

10 modulating the expression of the chimeric genes. For example, the lac operon from 
bacteriophage lambda plac5 is well known in the art and is inducible by the addition of 
IPTG to die growth medium. Other known inducible promoters useful in a bacteria 
expression system include pL of bacteriophage X> the trp promoter, and hybrid promoters 
such as the tac promoter, and the like. 

IS In addition, selection marker sequences for selecting and maintaining only those 

cells expressing the desirable fusion proteins should also be incorporated into the 
expression vectors. Numerous selection markers including auxotrophic markers and 
antibiotic resistance markers are known in the art and can all be useful for purposes of 
this invention. For example, the bla gene which confers ampicillin resistance is the most 

20 commonly used selection marker in prokaryotic expression vectors. Other suitable 

markers include genes that confer neomycin, kanamycra, or hygromycin resistance to the 
host cells. In fact, many vectors are commercially available from vendors such as 
Invitrogen Corp. of San Diego, Calif,, Clontech Corp. of Palo Alto, Calif., BRL of 
Bethesda, Maryland, and Promcga Corp. of Madison, Wiscon. These commercially 

25 available vectors, e.g., pBR322, pSPORT, pBluescriptDSK, pcDNAI, and pcDNAH all 
have a multiple cloning site into which the chimeric genes of the present invention can be 
conveniently inserted using conventional recombinant techniques. The constructed 
expression vectors can be introduced into host cells by various transformation or 
transfection techniques generally known in the ait. 

30 In another embodiment, mammalian cells are used as host cells for the expression 

of the fusion proteins and detection of protein-protein interactions. For this purpose, 
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virtually any mammalian cells can be used including normal tissue cells, stable cell lines, 
and transformed tumor cells. Conveniently, TnanxmnliaTi cell lines such as CHO cells, 
Juricat T cells, NTH 3T3 cells, HEK-293 cells, CV-l cells, COS-1 cells, HeLa cells, 
VERO cells, MDCK cells, WI38 cells, and the like are used. Mammalian expression 
5 vectors are well known in the ait and many are commercially available. Examples of 
suitable promoters for the transcription of the chimeric genes in mammalian cells include 
viral transcription promoters derived from adenovirus, simian virus 40 (SV40) (e.g., the 
early and late promoters of SV40), Rous sarcoma virus (RSV), and cytomegalovirus 
(CMV) (e.g., CMV immediate-early promoter), human immunodeficiency virus (HIV) 

10 (e.g., long terminal repeat (LTR)), vaccinia virus (e.g., 7.5K promoter), and herpes 

simplex virus (HSV) (e.g., thymidine kinase promoter). Inducible promoters can also be 
used Suitable inducible promoters include, for example, the tetracycline responsive 
element (TRE) (See Gossen et aL> Proc. Natl Acad. ScL USA, 89:5547-5551 (1992)), 
metallothionein DA promoter, ecdysone-responsive promoter, and heat shock promoters. 

15 Suitable origins of replication far the replication and maintenance of the expression 
vectors in mammalian cells include, e.g., the Epstein Batr origin of replication in the 
presence of the Epstein Barr nuclear antigen (see Sugden et of., Mole. Cell Biol, 5:410- 
413 (1985)) and the SV40 origin of replication in the presence of the SV40 T antigen 
(which is present in COS-1 and COS-7 cells) (see Margolskee etal., Mole. Cell Biol, 

20 8:2837 (1988)). Suitable selection markers include, but are not limited to, genes 
conferring resistance to neomycin, hygromycin, zeocin, and the like. Many 
commercially available mammalian expression vectors may be useful for the present 
invention, including, e.g., pCEP4, pcDNAI, pIND, pSecTag2, pVAXl, pcDNA3.1, and 
pBI-EGFP, and pDisplay. The vectors can be introduced into mammfliigti cells using any 

25 known techniques such as calcium phosphate precipitation, lipofection, electroporation, 
and the like. The bait vector and prey vector can be co-transformed into the same cell or, 
alternatively, introduced into two different cells which are subsequently fused together by 
cell fusion or other suitable techniques. 

Viral expression vectors, which permit introduction of recombinant genes into 

30 ceUs by viral infection, can also be used for the expression of the fusion protons. Viral 
expression vectors generally known in the art include viral vectors based on adenovirus, 
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bovine papilloma virus, murine stem cell vims (MSCV), MFG virus, and retrovirus. See 
Sarver, et al, Mol Cell Biol., 1: 486 (1981); Logan & Shenk, Proc. Natl Acad. ScL 
USA, 81:3655-3659 (1984); Mackett, et al., Proc. Natl. Acad. ScL USA, 79:7415-7419 . 
(1982); Mackett, et aL, J. Virol, 49:857-864 (1984); Pamcali, et al, Proc. Natl. Acad. 
5 ScL USA 79:4927-4931 (1982); Cone & Mulligan, Proc. Natl Acad. ScL USA, 81:6349- 
6353 (1984); Mann etal, Cell, 33:153-159(1993); Pear et dL % Proc. Natl Acad. ScL 
USA, 90:8392-8396 (1993); Kitamura et al., Proc. NatL Acad. ScL USA, 92:9146-9150 
(1995); Kinsella et al, Human Gene Therapy, 7:1405-1413 (1996); Hofmann et al., Proc. 
Natl Acad. ScL USA, 93:5185-5190 (1996); Choate et al., Human Gene Therapy, 7:2247 

10 (1996); WO 94/19478; Hawley et al. 9 Gene Therapy, 1:136 (1994) and Rivere et al., 
Genetics, 92:6733 (1995), all of which are incorporated by reference. 

Generally, to construct a viral vector, a chimeric gene according to the present 
invention can be operably linked to a suitable promoter. The promoter-chimeric gene 
construct is then inserted into a non-essential region of the viral vector, typically a 

15 modified viral genome. This results in a viable recombinant virus capable of expressing 
the fusion protein encoded by the chimeric gene in infected host cells. Once in die host 
ceil, the recombinant virus typically is integrated into the genome of the host cell. 
However, recombinant bovine papilloma viruses typically replicate and remain as 
extrachromosomal elements. 

20 In another embodiment, the detection assays of the present invention are 

conducted in plant cell systems. Methods for expressing exogenous proteins in plant 
cells are well known in the art See generally, Weissbach & Weissbach, Methods for 
Plant Molecular Biology, Academic Press, NY, 1988; Grierson & Corey, Plant 
Molecular Biology, 2d Ed, Blackie, London, 1988. Recombinant virus expression 

25 vectors based on, e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic virus (TMV) 
can all be used Alternatively, recombinant plasmid expression vectors such as Ti 
plasmid vectors and Ri plasmid vectors are also useful. The chimeric genes encoding the 
fusion proteins of the present invention can be conveniently cloned into the expression 
vectors and placed under control of a viral promoter such as the 35S RNA and 19S RNA 

30 promoters of CaMV or the coat protein promoter of TMV, or of a plant promoter, e.g., 
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the promoter of the small subunit of RUBISCO and heat shock promoters (e.g., soybean 
hspl7.5-E or hspl7.3-B promoters). 

In addition, the in vivo assay of the present invention can also be conducted in 
insect cells, e.g., Spodopterafrugiperda cells, using a baculovirus expression system. 
5( Expression vectors and host cells useful in this system are well known in the art and are 
generally available from various commercial vendors. For example, the chimeric genes 
of the present invention can be conveniently cloned into a non-essential region (e.g., the 
polyhedrin gene) of an Autographa californica nuclear polyhidrosis virus (AcNPV) 
vector and placed under control of an AcNPV promoter (e.g., the polyhedrin promoter). 

10 Hie non-occluded recombinant viruses thus generated can be used to infect host cells 

such as Spodopterafrugiperda cells in which the chimeric genes are expressed. See U.S. 
Patent No. 4,215,051. 

lii a preferred embodiment of die present invention, die fusion proteins are 
expressed in a yeast expression system using yeasts such as Saccharomyces cerevisiae, 

15 Hansenula polymorphs Pichia pastoris, and Schizosaccharomyces pombe as host cells. 
Hie expression of recombinant proteins in yeasts is a well-developed field, and the 
techniques useful in this respect are disclosed in detail in The Molecular Biology of the 
Yeast Saccharomyces, Eds. Strathern et al. y Vols. I and H, Cold Spring Harbor Press, 
1982; Ausubel et aL, Current Protocols in Molecular Biology, New York, Wiley, 1994; 

20 and Guthrie and Fink, Guide to Yeast Genetics and Molecular Biology, in Methods in 

Enzymology, Vol. 194, 1991, all of which are incorporated herein by reference. Sudbery, 
Curr. Opiru Biotech., 7:517-524 (1996) reviews the success in die art in expressing 
recombinant proteins in various yeast species; the entire content and references cited 
therein are incorporated herein by reference. In addition, B artel and Fields, eds M The 

25 Yeast Two-Hybrid System, Oxford University Press, New York, NY, 1997 contains 
extensive discussions of recombinant expression of fusion proteins in yeasts in the 
context of various yeast two-hybrid systems, and cites numerous relevant references: 
These and other methods known in the art can all be used for purposes of the present 
invention. Hie application of such methods to the present invention should be apparent 

30 to a skilled artisan apprised of the present disclosure. 
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Generally, each of the two chimeric genes is included in a separate expression 
vector (bait vector and prey vector). Both vectors can be co-transformed into a single 
yeast host cell. As will be apparent to a skilled artisan, it is also possible to express both 
chimeric genes from a single vector. In a preferred embodiment; the bait vector and prey 
5 vector are introduced into two haploid yeast cells of opposite mating types, e.g., a-type 
and a-type, respectively. The two haploid cells can be mated at a desired time to form a 
diploid cell expressing both chimeric genes. 

Generally, die bait and prey vectors for recombinant expression in yeast include a 
yeast replication origin such as the 2\i origin or (he ARSH4 sequence for the replication 

10 and maintenance of the vectors in yeast cells. Preferably, the vectors also have a bacteria 
origin of replication (e.g., ColEl) and a bacteria selection marker (e.g., amp R marker, i.e., 
bUz gene). Optionally, the CEN6 centromeric sequence is included to control the 
replication of the vectors in yeast cells. Any constitutive or inducible promotes capable 
of driving gene transcription in yeast cells may be employed to control the expression of 

IS the chimeric genes. Such promoters are operably linked to the chimeric genes. Examples 
of suitable constitutive promoters include but are not limited to the yeast ADH1 , PGK1 f 
TEF2 , GPD1 , HIS3> and CYC1 promoters. Example of suitable inducible promoters 
include but are not limited to the yeast GAU (inducible by galactose), CUP1 (inducible 
by Cu**), and FQS1 (inducible by pheromone) promoters; the AOX/MOX promoter from 

20 H. polymorpha and P. Pastoris (repressed by glucose or ethanol and induced by 

methanol); chimeric promoters such as those that contain LexA operators (inducible by 
LexA-containing transcription factors); and the like. Inducible promoters are preferred 
when the fusion proteins encoded by the chimeric genes are toxic to the host cells, if it is 
desirable, certain transcription repressing sequences such as the upstream repressing 

25 sequence (URS) from SP013 promoter can be operably linked to the promoter sequence, 
e.g., to the 5* end of the promoter region. Such upstream repressing sequences function 
to fine-tune the expression level of the chimeric genes. 

Preferably, a transcriptional termination signal is operably linked to the chimeric 
genes in the vectors. Generally, transcriptional termination signal sequences derived 

30 from, e.g., the CYC1 and ADH1 genes can be used. 
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Additionally, it is preferred that the bait vector and prey vector contain one or 
more selectable markers for the selection and maintenance of only those yeast cells that 
harbor a chimeric gene. Any selectable markers known in the art can be used for 
purposes of this invention so long as yeast cells expressing the chimeric gene(s) can be 
5 positively identified or negatively selected. Examples of markers that can be positively 
identified are those based on color assays, including the lacZ gene which encodes 0- 
galactosidase, the firefly lucif erase gene, secreted alkaline phosphatase, horseradish 
peroxidase, the blue fluorescent protein (BFP), and the green fluorescent protein (GFP) 
gene (see Cubitt et aU Trends BiocJiem. Sci., 20:448-455 (1995)). Other markers 

10 emitting fluorescence, chemUuminescence, UV absorption, infrared radiation, and the 
like can also be used. Among the markers that can be selected are auxotrophic markers 
including, but not limited to, URA3, H2S3, TKP1, LEU2, LYS2, ADE2> and the like. 
Typically, for purposes of auxotrophic selection, the yeast host cells transformed with 
bait vector and/or prey vector are cultured in a medium lacking a particular nutrient. 

15 Other selectable markers are not based on auxotrophies, but rather on resistance or 

sensitivity to an antibiotic or other xenobiotic. Examples of such markers include but are 
not limited to chloramphenicol acetyl transferase (CAT) gene, which confers resistance to 
chloramphenicol; CAN1 gene, which encodes an arginine permease and thereby renders 
cells sensitive to canavanine (see SikorsM etaL, Meth Enzymol, 194:302-318 (1991)); 

20 the bacterial kanamycin resistance gene (kan R ), which renders eucaryotic cells resistant to 
the aminoglycoside G418 (see Wach etaL, Yeast, 10:1793-1808 (1994)); and CYH2 
gene, which confers sensitivity to cycloheximide (see Sikorski et aL p Metfu Enzymol, 
194:302-318 (1991)). In addition, the CUP1 gene, which encodes metallothionein and 
thereby confers resistance to copper, is also a suitable selection marker. Each of the 

25 above selection markers may be used alone or in combination. One or more selection 
markers can be included in a particular bait or prey vector. The bait vector and prey 
vector may have the same or different selection markers. In addition, the selection 
pressure can be placed on the transformed host cells either before or after mating the 
haploid yeast cells. 

30 As will be apparent, the selection markers used should complement the host 

strains in which the bait and/or prey vectors are expressed. In other words, when a gene 
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is used as a selection marker gene, a yeast strain lacking the selection marker gene (or 
having mutation in the corresponding gene) should be used as host cells. Numerous yeast 
strains or derivative strains corresponding to various selection markers are known in the 
art Many of them have been developed specifically for certain yeast two-hybrid 
5 systems* The application and optional modification of such strains with respect to the 
present invention should be apparent to a skilled artisan apprised of the present 
disclosure. Methods for genetically manipulating yeast strains using genetic crossing or 
recombinant mutagenesis are well known in the art. See e.g., Rothstein, MetK EnzymoL, 
101:202-211 (1983). By way of example, the following yeast strains are well known in 
10 the art, and can be used in the present invention upon necessary modifications and 
adjustment: 

L40 strain which has the genotype MATa his3A200 trpl-901 leu2-3,112 ade2 
LYS2::(lexAop)4-HIS3 URA3::(lexAop)8-lacZ\ 

BGY48 strain which has the genotype MATa trpl his3 ura3 6ops-LEU2; and 
15 MaV103 strain which has the genotype MATa ura3-52 leu2-3 9 112 trpl-901 

his3A200 ade2-101 gcd4A gal80A SPAL10:;URA3 GALl::HIS3::lys2 {see Kumar et al $ 
J. Biol Chenu 272:13548-13554 (1997); Vidal et dL, Proc Natl. Acad. ScL USA, 
93:10315-10320 (1996)). Such strains are generally available in the research community, 
and can also be obtained by simple yeast genetic manipulation. See, e.g., The Yeast Two- 
20 Hybrid System, Battel and Reids, eds., pages 173-182, Oxford University Press, New 
York, NY, 1997. 

In addition, the following yeast strains are commercially available: 
Y190 strain which is available from Clontech, Palo Alto, California and has the 
genotype MATa gal4 gal80 his3A200 trpl-901 ade2-101 ura3-52 leu2-3, 112 
25 VRA3::GAL14acZ LYS2::GAL1-H1S3 cyh T \ and 

YRO-2 Strain which is available from Stratagene, La Jolla, California and has the 
genotype MATa ura3-52 his3-200 ade2-101 tys2-801 trpl-901 leu2-3, 112 gal4-542 
gal80-538 LYS2::GAU-HIS3 URA3::GAU/CYCl-lacZ> 

In fact, different versions of vectors and host strains specially designed for yeast 
30 two-hybrid system analysis are available in kits from commercial vendors such as 
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Clontech, Palo Alto, California and Stratagene, La Jolla, California, all of which can be 
modified for use in the present invention. 

4.3.1.2. Reporters 

5 Generally, in a transcription-based two-hybrid assay, the interaction between a 

bait fusion protein and a prey fusion protein brings the DNA-binding domain and the 
transcription-activation domain into proximity forming a functional transcriptional factor, 
winch acts on a specific promoter to drive the expression of a reporter protein. The 
transcription activation domain and the DNA-binding domain may be selected from 

10 various known transcriptional activators, e.g., GAL4, GCN4, ARD1, the human estrogen 
receptor, B. coli LexA protein, herpes simplex virus VP16 (Triezenberg et aL, Genes 
Dev. 2:718-729 (1988)), the E. coli B42 protein (acid blob, see Gyuris et aL % Cell, 
75:791-803 (1993)), NF-kB p65, and the like- Hie reporter gene and the promoter 
driving its transcription typically are incorporated into a separate reporter vector. 

15 Alternatively, the host cells are engineered to contain such a promoter-reporter gene 

sequence in their chromosomes. Thus, the interaction or lack of interaction between two 
interacting protein members of a protein complex can be determined by detecting or 
measuring changes in the reporter in the assay system. Although the reporters and 
selection markers can be of similar types and used in a similar manner in the present 

20 invention, the reporters and selection markers should be carefully selected in a particular 
detection assay such that they are distinguishable from each other and do not interfere 
with each other's function. 

Many different types reporters are useful in the screening assays. For example, a 
reporter protein may be a fusion protein having an epitope tag fused to a protein. 

25 Commonly used and commercially available epitope tags include sequences derived 
from, e.g., influenza virus hemagglutinin (HA), Simian Virus 5 (V5), polyhistidine 
(6xHis), c-myc, lacZ, GST, and the like. Antibodies specific to these epitope tags are 
generally commercially available. Thus, the expressed reporter can be detected using an 
epitope-specific antibody in an immunoassay. 

30 In another embodiment, the reporter is selected such that it can be detected by a 

color-based assay. Examples of such reporters include, e.g., the lacZ protein (P~ 
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galactosidase), the green fluorescent protein (GFP), which can be detected by 
fluorescence assay and sorted by flow-activated cell sorting (FACS) {See Cubitt et aL, 
Trends Biochenu ScL, 20:448-455 (1995)), secreted alkaline phosphatase, horseradish 
peroxidase, the blue fluorescent protein (BFP), and lucif erase photoproteins such as 
5 aequorin, obelin, mnemiopsin, and berovin (See U.S. Patent No. 6,087,476, which is 
incorporated herein by reference). 

Alternatively, an auxotrophic factor is used as a reporter in a host strain deficient 
in the auxotrophic factor. Thus, suitable auxotrophic reporter genes include, but are not 
limited to, URA3, HIS3 % TRP1, LEU2, LYS2, ADE2, and the like. For example, yeast 

10 cells containing a mutant URA3 gene can be used as host cells (Ura~ phenotype). Such 
cells lack -encoded functional orotidine-5 ' -phosphate decarboxylase, an enzyme 
required by yeast cells for the biosynthesis of uracil. As a result, the cells are unable to 
grow on a medium lacking uracil. However, wild-type orotidine-5 ' -phsphate 
decarboxylase catalyzes the conversion of a non-toxic compound 5-fluoroorotic acid (5- 

15 FO A) to a toxic product, 5-fLuorouracil. Thus, yeast cells containing a wild-type URA3 
gene are sensitive to 5-FOA and cannot grow on a medium containing S-FOA. 
Therefore, when the interaction between the interacting protein members in the fusion 
proteins results in the expression of active orotidine-5' -phosphate decarboxylase, the Ura" 
(F6a R ) yeast cells will be able to grow on a uracil deficient medium (SC-Ura plates). 

20 However, such cells will not survive on a medium containing 5-FOA. Thus, protein- 
protein interactions can be detected based on cell growth. 

Additionally, antibiotic resistance reporters can also be employed in a similar 
manner. In this respect, host cells sensitive to a particular antibiotics is used. Antibiotics 
resistance reporters include, for example, chloramphenicol acetyl transferase (CAT) gene 

25 and the kan R gene, which confers resistance to G4 18 in eukaryotes and to kanamycin in 
prokaryotes. 

43.1.3. Screening Assays for Interaction Antagonists 

The screening assay of the present invention is useful in selecting compounds 
30 capable of interfering with or disrupting or dissociating protein-protein interaction in the 
protein complexes of the present invention, i.e., interactions between TsglOl or a 
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homologue or derivative thereof and HIV GAG, HIV GAGp6, other retroviral GAG 
containing a P(T/S)AP late domain motif, or a homologue or derivative thereof. For 
example, TsglOl and HTV GAG play a role in HIV propagation, and thus are involved in 
HTV infection and AIDS. It may be possible to ameliorate or alleviate the diseases or 
disorders in a patient by interfering with or dissociating normal interactions between 
TsglOl and HIV GAG. Alternatively, if the disease or disorder is associated with 
increased expression of TsglOl and/or HIV GAG in accordance with the present 
invention, then the disease may be treated or prevented by weakening or dissociating the 
interaction between TsglOl and HIV GAG in a patient In addition, if a disease or 
disorder is associated with mutant forms of TsglOl and/or HIV GAG that lead to 
strengthened protein-protein interaction therebetween, then the disease or disorder may 
be treated with a compound that weakens or interferes with the interaction between the 
mutant forms of TsglOl and HIV GAG. 

In a screening assay for interaction antagonists, TsglOl and HIV GAG or 
GAGp6, for example, are used as test proteins expressed in the form of fusion proteins as 
described above for purposes of a two-hybrid assay. The fusion proteins are expressed in 
a host cell and allowed to interact with each other in the presence of one or more test 
compounds. 

In a preferred embodiment, a counterselectable marker is used as a reporter such 
that a detectable signal (e.g., appearance of color or fluorescence, or cell survival) is 
present only when the test compound is capable of interfering with the interaction 
between the two test proteins. In this respect, the reporters used in various '^re verse two- 
hybrid systems* 1 known in the art may be employed Reverse two-hybrid systems are 
disclosed in, e.g., U.S. Patent Nos. 5,525,490; 5,733,726; 5,885,779; Vidal et aL, Proc. 
Natl Acad. Set USA, 93:10315-10320 (1996); and Vidal et aL, Proc. Natl. Acad. ScL 
USA. 93: 10321-10326 (1996), all of which are incorporated herein by reference. 

Examples of suitable counterselectable reporters useful in a yeast system include 
the URA3 gene (encoding orotidine-5' -decarboxylase, which converts 5-fluroorotic acid 
(5-FOA) to the toxic metabolite 5-fluorouracil), the CAN1 gene (encoding arginine 
permease, which transports toxic arginine analog canavanine into yeast cells), the GAL1 
gene (encoding galactokinase, which catalyzes the conversion of 2-deoxygalactose to 
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toxic 2-deoxygalactose-l -phosphate), the L7S2 gene (encoding a-aminoadipate 
reductase, which renders yeast cells unable to grow on a medium containing a- 
aminoadipate as the sole nitrogen source), the METIS gene (encoding O 
acetylhomoserine sulfhydrylase, which confers on yeast cells sensitivity to methyl 
5 mercury), and die CYH2 gene (encoding L29 ribosomal protein, which confers sensitivity 
to cycloheximide). In addition, any known cytotoxic agents including cytotoxic proteins 
such as the diphtheria toxin (DTA) catalytic domain can also be used as counterselectable 
reporters. See U.S. Patent No. 5,733,726. DTA causes the ADP-ribosyiation of 
elongation factor-2 and thus inhibits protein synthesis and causes cell death. Other 
10 examples of cytotoxic agents include recin, Shiga toxin, and exotoxin A of Pseudomonas 
aeruginosa. 

For example, when the URA3 gene is used as a counterselectable reporter gene, 
yeast cells containing a mutant URA3 gene can be used as host cells (Ura" Foa R 
phenotype) for the in vivo assay. Such cells lack Z7RA3-encoded functional orotidine-5*- 

15 phsphate decarboxylase, an enzyme required for the biosynthesis of uracil. As a result, 
the cells are unable to grow on media lacking uracil. However, because of die absence of 
a wild-type orotidine-5 ' -phsphate decarboxylase, the yeast cells cannot convert non-toxic 
5-fluoroorotic acid (5-FOA) to a toxic product, 5-fluorouracil. Thus, such yeast cells are 
resistant to 5-FOA and can grow on a medium containing 5-FOA. Therefore, for 

20 example, to screen for a compound capable of disrupting interaction between TsglOl and 
HIV GAGp6, TsglOl can be expressed as a fusion protein with a DNA-binding domain 
of a suitable transcription activator while HIV GAGp6 is expressed as a fusion protein 
with a transcription activation domain of a suitable transcription activator. In die host 
strain, the reporter URA3 gene may be operably linked to a promoter specifically 

25 responsive to the association of the transcription activation domain and the DNA-binding 
domain. After the fusion proteins are expressed in the Ura" Foa R yeast cells, an in vivo 
screening assay can be conducted in the presence of a test compound with die yeast cells 
being cultured on a medium containing uracil and 5-FOA. If the test compound does not 
disrupt the interaction between TsglOl and HIV GAG oar GAGp6, active URA3 gene 

30 product, i.e., orotidme-5 9 -decarboxylase, which converts 5-FOA to toxic 5-fluorouracil, 
is expressed. As a result, the yeast cells cannot grow. On the other hand, when the test 
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compound disrupts the interaction between TsglOl and HIV OAO or GAGp6, no active 
orotidine-5 ' -decarboxylase is produced in the host yeast cells. Consequently, the yeast 
cells will survive and grow on the 5-FOA-containing medium. Therefore, compounds 
capable of interfering with or dissociating the interaction between TsglOl and HTV OAO 
5 or GAGp6 can thus be identified based on colony formation. 

As will be apparent, the screening assay of the present invention can be applied in 
a format appropriate for large-scale screening. For example, combinatorial technologies 
can be employed to construct combinatorial libraries of small organic molecules or small 
peptides. See generally, e.g., Kenan et at., Trends Biochem. Sc., 19:57-64 (1994); Gallop 

10 etaL, J. Med. Chem., 37:1233-1251 (1994); Gordon et aL, J. Med Chem., 37:1385-1401 
(1994); Ecker et aL, Biotechnology, 13:351-360 (1995). Such combinatorial libraries of 
compounds can be applied to the screening assay of the present invention to isolate 
specific modulators of particular protein-protein interactions. In the case of random 
peptide libraries, the random peptides can be co-expressed with the fusion proteins of the 

1 5 present invention in host cells and assayed in vivo, See e.g., Yang et al., Nucl. Acids Res., 
23:1152-1 156 (1995). Alternatively, they can be added to the culture medium for uptake 
by the host cells. 

Conveniently, yeast mating is used in an in vivo screening assay. For example, 
haploid cells of a-mating type expressing one fusion protein as described above is mated 

20 with haploid cells of a-mating type expressing the other fusion protein. Upon mating, the 
diploid cells are spread on a suitable medium to form a lawn. Drops of test compounds 
can be deposited onto different areas of the lawn. After culturing the lawn for an 
appropriate period of. time, drops containing a compound capable of modulating the 
interaction between the particular test proteins in the fusion proteins can be identified by 

25 stimulation or inhibition of growth in the vicinity of the drops. 

The screening assays of the present invention for selecting compounds capable of 
modulating protein-protein interactions can also be fine-tuned by various techniques to 
adjust the thresholds or sensitivity of the positive and negative selections. Mutations can 
be introduced into the reporter proteins to adjust their activities. The uptake of test 

30 compounds by the host cells can also be adjusted. Pot example, yeast high uptake 

mutants such as the erg6 mutant strains can facilitate yeast uptake of die test compounds. 
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See Oaber et al„ Mol Cell. Biol, 9:3447-3456 (1989). likewise, the uptake of the 
selection compounds such as 5-FOA, 2-deoxy galactose, cycloheximide, a-aminoadipate, 
and the like can also be fine-tuned. 

5 4.4. Virtual Screen and Compound Optimization 

Once the test compounds are selected capable of modulating the protein-protein 
interaction between the interacting partners of a protein complex of the present invention, 
a data set including data defining the identity or characteristics of the test compounds can 
be generated. The data set may include information relating to the properties of a 

10 selected test compound, e.g., chemical structure, chirality, molecular weight, melting 

point, etc. Alternatively, the data set may simply include assigned identification numbers 
understood by the researchers conducting the screening assay and/or researchers 
receiving the data set as representing specific test compounds. The data or information 
can be cast in a transmittable form that can be communicated or transmitted to other 

15 researchers, particularly researchers in a different country. Such a transmittable form can 
vary and can be tangible or intangible. For example, the data set defining one or more 
selected test compounds can be embodied in texts, tables, diagrams, molecular structures, 
photographs, charts, images or any other visual forms. The data or information can be 
recorded on a tangible media such as paper or embodied in computer-readable forms 

20 (e.g., electronic, electromagnetic, optical or other signals). The data in a computer- 
readable form can be stored in a computer usable storage medium (e.g., floppy disks, 
magnetic tapes, optical disks, and the like) or transmitted directly through a 
communication infrastructure. In particular, the data embodied in electronic signals can 
be transmitted in the form of email or posted on a website on the Internet or Intranet In 

25 addition, the information or data on a selected test compound can also be recorded in an 
audio form and transmitted through any suitable media, e.g., analog or digital cable lines, 
fibet optic cables, etc., via telephone, facsimile, wireless mobile phone, Internet phone 
and the like. 

Thus, the information and data on a test compound selected in a screening assay 
30 described above or by virtual screening as discussed below can be produced anywhere in 
the world and tra n s m itted to a different location. For example, when a screening assay is 
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conducted offshore, the information and data on a selected test compound can be 

generated and cast in a transmittable form as described above. The data and information 

in a transmittable form thus can be imported into the U.S. or transmitted to any other 

countries, where the data and information may be used in further testing the selected test 
S t compound and/or in modifying and optimizing the selected test compound to develop 

lead compounds for testing in clinical trials. 

Compounds can also be selected based on structural models of the target protein 

or protein complex and/or test compounds, e.g., by virtual screen. In addition, once an 

effective compound is identified, structural analogs or mimetic s thereof can be produced 
10 based on rational drug design with the aim of improving drug efficacy and stability, and 

reducing side effects. Methods known in the art for virtual screen and rational drag 

design can be used in the present invention. See, e.g., Hodgson et aL, Bio/Technology, 

9:19-21 (1991); U.S. Patent Nos. 5,800,998 and 5,891,628, all of which are incorporated 

herein by reference. An example of rational drug design is the development of HIV 
15 protease inhibitors. See Erickson et al $ Science, 249:527-533 (1990). 

In this respect, structural information on the target protein or protein complex is 

obtained. Preferably, atomic coordinates defining a three-dimensional structure of the v 

target protein or protein complex are obtained. For example, each of the interacting pair 

can be expressed and purified. The purified interacting protein pairs are then allowed to 
20 interact with each other in vitro under appropriate conditions. Optionally, the interacting 

protein complex can be stabilized by crosslinking or other techniques. Hie interacting 

complex can be studied using various biophysical techniques including, e.g., X-ray 

crystallography, NMR, computer modeling, mass spectrometry, and the like. Methods 

for obtaining such atomic coordinates by X-ray crystallography, NMR, and the like are 
25 known in the art and the application thereof to the target protein or protein complex of the 

present invention should be apparent to skilled persons in die art of structural biology. 

See Smyth and Martin, Mol Pathol, 53:8-14 (2000); Oakley and Wee, Clin. Exp. 

Pharmacol Physiol, 27(3): 145-151 (2000); Ferentz and Wagner, Q. Rev. Biophys., 

33:29-65 (2000); and Roberts, Curr. Opin. Biotecknol, 10:42-47 (1999). 
30 In addition, understanding of the interaction between the proteins of interest in the 

presence or absence of a modulator compound can also be derived from mutagenesis 
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analysis using yeast two-hybrid system or other methods for detecting protein-protein 
interaction. In this respect, various mutations can be introduced into the interacting 
proteins and the effect of die mutations on protein-protein interaction is examined by a 
suitable method such as die yeast two-hybrid system. 
5 Various mutations including amino acid substitutions, deletions and insertions can 

be introduced into a protein sequence using conventional recombinant DNA 
technologies. Generally, it is particularly desirable to decipher the binding sites. Thus, it 
is important that the mutations introduced only affect protein-protein interaction or 
protein-compound interaction and cause minimal structural disturbances. Mutations are 

10 preferably designed based on knowledge of the three-dimensional structure of the 

interacting proteins. Preferably, mutations are introduced to alter charged amino acids or 
hydrophobic amino acids exposed on the surface of the proteins, since ionic interactions 
and hydrophobic interactions are often involved in protein-protein interactions. 
Alternatively, the "alanine scanning mutagenesis" technique is used. See Wells, et al., 

15 Methods EnzymoL, 202:301-306 (1991); Bass et aL, Proc. Natl Acad, Set USA, 88:4498- 
4502 (1991); Bennet et al, J. Biol Chenu, 266:5191-5201 (1991); Diamond etal, J. 
Virol, 68:863-876 (1994). Using this technique, charged or hydrophobic amino acid 
residues of the interacting proteins axe replaced by alanine, and the effect on the 
interaction between the proteins is analyzed using e.g., the yeast two-hybrid system. For 

20 example, the entire protein sequence can be scanned in a window of five amino acids. 
When two or more charged or hydrophobic amino acids appear in a window, the charged 
or hydrophobic amino acids are changed to alanine using standard recombinant DNA 
techniques. The thus mutated proteins are used as "test proteins" in the above-described 
two-hybrid assay to examine the effect of the mutations on protein-protein interaction. 

25 Preferably, the mutagenesis analysis is conducted both in the presence and in the absence 
of an identified modulator compound. In this manner, the d omains or residues of the 
proteins important to protein-protein interaction and/or the interaction between the 
modulator compound and the proteins can be identified. likewise, interactions between a 
selected compound and a target protein (e,g., TsglOl) can also be studied by mutagenesis 

30 of the target protein. 
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Based on the structural information obtained, structural relationships between the 
interacting proteins, between a selected compound and the interacting proteins, or 
between a selected compound and a target protein are elucidated The moieties and the 
three-dimensional structure of the selected compound critical to its modulating effect on 
5 the interaction of the proteins of interest or on a target protein are revealed. Medicinal 
chemists can then design analog compounds having similar moieties and structures. 

In addition, an identified peptide compound capable of modulating a particular 
protein-protein interaction or a particular target protein can also be analyzed by the 
alanine scanning technique and/or a screening assay to determine the domains or residues 

10 of the peptide important to its modulating effect on a particular protein-protein interaction 
or a particular target protein. The peptide compound can be used as a lead molecule for 
rational design of small organic molecules or peptide mimetics. See Huber et aL, Curr. 
Med. Chem., 1:13-34 (1994). 

The residues or domains critical to the modulating effect of the identified 

15 compound constitute the active region of the compound known as its ^pharmacophore." 
Once the pharmacophore has been elucidated, a structural model can be established by a 
modeling process that may incorporate data from NMR analysis, X-ray diffraction data, 
alanine scanning, spectroscopic techniques and the like. Various techniques including 
computational analysis, similarity mapping and the like can all be used in this modeling 

20 process. See e.g., Perry et aL % in OSAR: Quantitative Structure-Activity Relationships in 
Drug Design, pp.189-193, Alan R. Uss, Inc., 1989; Rotivinen et aL, Acta 
Pharmaceutical Fennica, 97:159-166 (1988); Lewis et at, Proc. R Soc. LoruL, 236:125- 
140 (1989); McKinaly etaL 7 Annu. Rev. Pharmacol Taxiciol, 29:111-122 (1989). 
Commercial molecular modeling systems available from Polygen Corporation, Waltham, 

25 MA, include the CHARMm program, which performs the energy minimization and 

molecular dynamics functions, and QUANTA program which performs the construction, 
graphic modeling and analysis of molecular structure. Such programs allow interactive 
construction, visualization and modification of molecules. Other computer modeling 
programs are also available from BioDesign, Inc. (Pasadena, CA.), Hypercube, Inc. 

30 (Cambridge, Ontario), and Allelix, Inc. (Mississauga, Ontario, Canada). 
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A template can be fanned based on the established model. Various compounds 
can then be designed by linking various chemical groups or moieties to the template. 
Various moieties of the template can also be replaced. In addition, in the case of a 
peptide lead compound, the peptide or mimetics thereof can be cyclized, e~g., by linking 
5 the N-terminus and C-terminus together, to increase its stability. These rationally 
designed compounds are further tested. In this manner, pharmacologically acceptable 
and stable compounds with improved efficacy and reduced side effect can be developed. 
Hie compounds identified in accordance with the present invention can be incorporated 
into a pharmaceutical formulation suitable for administration to an individual. 

10 In addition, the structural models or atomic coordinates defining a three- 

dimensional structure of the target protein or protein complex can also be used in virtual 
screen to select compounds capable of modulating the target protein or protein complex. 
Various methods of computer-based virtual screen using atomic coordinates are generally 
known in the ait. For example, U.S. Patent No. 5,798,247 (which is incorporated herein 

IS by reference) discloses a method of identifying a compound (specifically, an interleulrin 
converting enzyme inhibitor) by determining binding interactions between an organic 
compound and binding sites of a binding cavity within the target protein. The binding 
sites are defined by atomic coordinates. 

Thus, as will be apparent to a skilled artisan, atomic coordinates defining a three- 

20 dim en sional structure of a target protein or protein complex of the present invention can 
be provided by any methods known in the art Compounds can then be designed or 
selected based on the atomic coordinates. 

5. Therapeutic Applications 

25 In accordance with another aspect of the present invention, methods are provided 

for modulating a protein complex comprising TsglOl and HIV GAG or GAGp6 in 
human cells. Hie human cells can be in in vitro cell or tissue cultures. The methods are 
also applicable to human cells in a patient. 

In one embodiment, the concentration of a protein complex having TsglOl 

30 interacting with HIV GAG or GAGp6 is reduced in the cells. Various methods can be 
employed to reduce the concentration of the protein complex. The protein complex 
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concentration can be reduced by interfering with the interactions between TsglOl and 
HIV GAG or OAGp6. Far example, compounds capable of interfering with interactions 
between TsglOl and HIV GAG or GAGp6 can be administered to the cells in vitro or in 
vivo in a patient. Such compounds can be compounds capable of binding TsglOl 
5 protein, particularly the UEV domain of TsglOl protein, or HIV GAG or GAGp6. They 
can also be antibodies immunoreactive with the TsglOl protein or HIV GAG or GAGp6. 
Preferably, antibodies that bind to the UEV domain of the TsglOl protean are used. Also, 
the compounds can be small peptides derived from the HIV GAG or GAGp6 protein or 
mimetics thereof capable of binding TsglOl, or small peptides derived from TsglOl 
10 protein or mimetics thereof capable of binding HIV GAG oar GAGp6. 

In another embodiment, the method of modulating the protein complex includes 
inhibiting the expression of TsglOl protein and/or HTV GAG or GAGp6 protein. The 
inhibition can be at the transcriptional, translational, or post-translational level. For 
example, anti sense compounds and ribozyme compounds can be administered to human 
IS cells in cultures or in human bodies. 

In the various embodiments described above, preferably the concentrations or 
activities of both TsglOl protein and HTV GAG or GAGp6 are reduced or inhibited. 

In yet another embodiment, an antibody selectively immunoreactive with a 
protein complex having TsglOl interacting with HTV GAG or GAGp6 is administered to 
20 cells in vitro or in human bodies to inhibit the protein complex activities and/or reduce 
the concentration of the protein complex in the cells or patient. 

Hie methods for modulating the protein complex comprising TsglOl and HIV 
GAG or GAGp6 as provided according to the present invention can be used to inhibit 
HIV viral budding from infected host cells. When multiple cells are present, e.g., in cell 
25 culture or in a patient's body, the inhibition of viral budding prevents the viruses from 
being released from the infected host cells thereby suppressing further viral propagation. 
Accordingly, the present invention also encompasses methods of treating HIV infection 
and preventing AIDS in patients by reducing the concentration or inhibiting the activities 
of protein complexes having TsglOl and HIV GAG or GAGp6, or by reducing the 
30 concentration or inhibiting the activities of TsglOl or of HIV GAG or GAGp6. 
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In addition* the methods for modulating the protein complex comprising TsglOl 
and HTV GAG or GAGp6 can also be useful in inhibiting the budding of many other 
viruses, particularly those viruses whose budding from infected host cells is dependent on 
the late domain motif P(T/S)AP. As described above, the P(T/S)AP motif, which is 
5 responsible for HIV GAGpffs interaction with TsglOl, is conserved among the GAGp6 
domains of all known primate lend viruses. In non-primate lentiviruses, which lack a 
GAGp6 domain, the P(T/S)AP motif is at the immediate C terminus of the GAG 
polyprotein. In addition, many other retroviruses also contain the P(T/S)AP late domain 
motif in their GAG polypeptides, which are belived to also interact with TsglOl in the 

10 same manner as the HIV GAGp6 protein. 

Accordingly, the present invention also provides methods for modulating a 
protein complex containing TsglOl protein and a human retrovirus (other than HTV) 
GAG polypeptide or a fragment thereof containing a P(T/S)AP late domain motif. Such 
modulating methods can be used in inhibiting the budding of such a virus from its 

IS infected host cells and in treating infections by such a virus in patients. 

In addition, the present invention also encompasses methods for modulating a 
protein complex containing a TsglOl protein ortholog and a non-human retrovirus GAG 
polypeptide or a fragment thereof containing a P(T/S)AP late domain motif. Such 
modulating methods can be used in inhibiting the budding of such non-human 

20 retroviruses from their infected animal host cells and in treating infections by such 
viruses in animals. 

Examples of such non-human retroviruses include, but are not limited to, primate 
lentiviruses other than HIV and non-primate lentiviiuses (except for EIAV). As is known 
in the art, lentiviruses are a group of retroviruses capable of long-term latent infection of 

25 vertebrate cells- They replicate in host cells only when activated. Lentiviruses typically 
*foave enveloped virions. Non-primate lentiviruses include bovine lentiviruses (e.g. 
bovine immunodeficiency virus (BIV), Jembrana disease virus), feline lentiviruses (e.g. 
feline immunodeficiency virus (FIV) which causes immunodeficiency, wasting, and 
encephalitis in cats), ovine/caprine lenti virus (e.g. caprine arthritis-encephalitis virus 

30 (CAEV) which causes anemia and wasting in goats, ovine lentivirus, Visna virus which 
causes pneumonia, wasting, encephalitis and arthritis), and Equine lentiviruses (e.g. 
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Equine infectious anemia vims (EIAV), which infects horses causing arthritis and 
encephalitis). Examples of non-human primate lenti viruses include various simian 
immunodeficiency viruses that infect hosts such as chimpanzee, mangabey, African 
Green monkey, mandrill, LHoest, Sykes 1 monkey, or Guereza Colobus monkey. 
5 The methods for modulating a protein complex containing TsglOl protein and a 

human retrovirus GAG polypeptide or a fragment thereof containing a P(T/S)AP late 
domain motif and the methods for modulating a protein complex containing a TsglOl 
protein ortholog and a non-human retrovirus GAG polypeptide or a fragment thereof 
containing a P(T/S)AP late domain motif should be performed in a manner similar to 

10 those described above in the context of modulating a protein complex of TsglOl and HIV 
GAG or GAGp6. Likewise, the methods for treating infection by such other human or 
non-human viruses should be similar to the methods for treating HIV infection, as will be 
apparent to skilled artisans apprised of the present disclosure. 

Specifically, the concentration of the protein complexes can be reduced in the 

IS cells by various methods. For example, the protein complex concentration can be 
reduced by interfering with the interactions between TsglOl protein and a human 
retrovirus GAG polypeptide or a fragment thereof containing a P(T/S)AP late domain 
motif, or the interactions between a TsglOl protein ortholog and non-human retroviruses 
GAG polypeptide or a fragment thereof containing a P(T/S)AP late domain motif. 

20 Compounds capable of interfering with the interactions can be administered to cells in 
vitro or in vivo in a subject to be treated. Such compounds can be compounds capable of 
binding TsglOl protein or the TsglOl protein ortholog, particularly the UEV domain of 
TsglOl protein or the TsglOl protein ortholog. They can also be antibodies 
immunoreactive with TsglOl protein or the TsglOl protein ortholog. Preferably, 

25 antibodies that bind to the UEV domain of the TsglOl protein or the TsglOl protein 
^ ortholog are used. Also, the compounds can be small peptides derived from a retrovirus 
GAG polypeptide, preferably including the amino acid residues spanning the P(T/S)AP 
late domain motif, hi addition, antibodies selectively immunoreactive with the protein 
complexes can be administered. In other embodiments, the expression of TsglOl protein 

30 or a TsglOl ortholog is inhibited. The inhibition can be at the transcriptional, 
tran&lational, or post-translational level. For example, antisense compounds and 
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ribozyme compounds can be administered to cells in culture or in the subject to be 
treated. 

The details of the various methods for modulating the protein complexes or 
protein-protein interactions and methods for treating virus infection are described below. 
5 Although such details are described in the context of interactions between TsglOl protein 
and HIV GAG or GAGp6 and HIV infection in human cells, the analogous methods with 
respect to infections by other viruses should be apparent to skilled artisans apprised of the 
present disclosure. 

10 5.1. Antibody Therapy 

In one embodiment, an antibody may be administered to cells or tissue in vitro or 
in a patient The antibody administered may be immunoieactive with TsglOl or HIV 
GAG or GAGp6. Suitable antibodies may be monoclonal or polyclonal that fall within 
any antibody classes, e.g., IgG, IgM, IgA, etc. The antibody suitable for this invention 

IS may also take a form of various antibody fragments including, but not limited to, Fab and 
F(ab')2, single-chain fragments (scFv) ("single-chain antibodies"), and the like. In one 
embodiment, an antibody selectively immunoieactive with the protein complex formed 
from TsglOl and HIV GAG or GAGp6 in accordance with the present invention is 
administered to cells or tissue in vitro or in a patient In another embodiment, an 

20 antibody specific to TsglOl is administered to cells or tissue in vitro or in a patient 

Preferably, an antibody specific to the UEV domain of TsglOl is administered to cells or 
tissue in vitro or in a patient Methods for making the antibodies of the present invention 
should be apparent to a person of skill in the art, especially in view of the discussions in 
Section 3 above. Die antibodies can be administered in any suitable form and route as 

25 described in Section 6 below. Preferably, the antibodies are administered in a 
pharmaceutical composition together with a pharmaceutical^ acceptable carrier. 

Alternatively, the antibodies may be delivered by a gene-therapy approach. That 
is, nucleic acids encoding the antibodies, particularly single-chain fragments (scFv), may 
be introduced into cells or tissue in vitro or in a patient such that desirable antibodies may 

30 be produced by recombinant expression in vivo from the nucleic acids. For this purpose, 
the nucleic acids with appropriate transcriptional and translation regulatory sequences can 
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be directly administered into the patient Alternatively, the nucleic acids can be 
incorporated into a suitable vector as described in Sections 4 and S.S and delivered into 
cells or tissue in vitro or in a patient along with the vector. Hie expression vector 
containing the nucleic acids can be administered directly to a patient It can also be 
5 introduced into cells, preferably cells derived from a patient to be treated, and 

subsequently delivered into the patient by cell transplantation. See Section S.S below. 

5.2. Antisense Therapy 
In another embodiment antisense compounds specific to nucleic acids encoding 

10 one or more interacting protein members of a protein complex identified in the present 
invention is administered to cells or tissue in vitro or in a patient to be therapeutically or 
prophylactically treated. The antisense compounds should specifically inhibit the 
expression of the one or more interacting protein members. In preferred embodiments, 
antisense compounds specifically hybridizing to a TsglOl nucleic acid is administered. 

15 As is known in the art, antisense drugs generally act by hybridizing to a particular target 
nucleic acid thus blocking gene expression. Methods for designing antisense compounds 
and using such compounds in treating diseases are well known and well developed in the 
art For example, the antisense drug Vitravene® (fomivirsen), a 21-base long 
oligonucleotide, has been successfully developed and marketed by Isis Pharmaceuticals, 

20 Inc. for treating cytomegalovirus (CMV>induced retinitis. 

Any methods for designing and making antisense compounds may be used for 
purpose of the present invention. See generally, Sanghvi et at, eds., Antisense Reseach 
and Applications, CRC Press, Boca Raton, 1993. Typically, antisense compounds are 
oligonucleotides designed based on the nucleotide sequence of the mRNA or gene of one 

25 or more of the interacting protein members of a particular protein complex of the present 
invention. In particular, antisense compounds can be designed to specifically hybridize 
to a particular region of the gene sequence or mRNA of one or more of the interacting 
protein members to modulate (increase or decrease), replication, transcription, or 
translation. As used herein, the term "specifically hybridize" or paraphrases thereof 

30 means a sufficient degree of complementarity or pairing between an antisense oligo and a 
target DNA or mRNA such that stable and specific binding occurs therebetween. In 
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particular, 100% complementary or pairing is not required. Specific hybridization takes 
place when sufficient hybridization occurs between the antisense compound and its 
intended target nucleic acids in substantially absence of non-specific binding of the 
antisense compound to non-target sequences under predetermined conditions, e.g., for 
5 purposes of in vivo treatment, preferably under physiological conditions. Preferably, 
specific hybridization results in the interference with normal expression of the target 
DNAormRNA. 

For example, an antisense oligo can be designed to specifically hybridize to the 
replication or transcription regulatory regions of a target gene, or the translation 
10 regulatory regions such as translation initiation region and exon/intron junctions, or the 
coding regions of a target mRNA. Preferably, TsglOl gene or TsglOl mRNA is used as 
the target. 

As is generally known in the art, commonly used oligonucleotides are oligomers 
or polymers of ribonucleic acid or deoxyribonucleic acid having a combination of 

IS naturally-occurring nucleoside bases, sugars and covalent linkages between nucleoside 
bases and sugars including a phosphate group. However, it is noted that the term 
"oligonucleotides" also encompasses various non-naturally occurring roimelics and 
derivatives, i.e., modified forms, of naturally-occurring oligonucleotides as described 
below. Typically an antisense compound of the present invention is an oligonucleotide 

20 having from about 6 to about 200, preferably from about 8 to about 30 nucleoside bases. 

The antisense compounds preferably contain modified backbones or non-natural 
intemucleoside linkages, including but not limited to, modified phosphorous-containing 
backbones and non-phosphorous backbones such as morpholino backbones; siloxane, 
sulfide, sulfoxide, sulfone, sulfonate, sulfonamide, and sulfamate backbones; formacetyl 

25 and thioformacetyl backbones; alkene-containing backbones; methyleneimino and 
methylenehydrazino backbones; amide backbones, and the like. 

Examples of modified phosphorous-containing backbones include, but are not 
limited to phosphorothioates , phosphorodi thioates , chiral phospharothioates, 
phosphotriesters, aminoalkylphosphotriesters, alkyl phosphorates, 

30 thionoalkylphosphonates, phosphinates, phosphoramidates, thionophosphoramidates, 
thionoalkylphosphotriesters, and boranophosphates and various salt forms thereof. See 
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e.g.> U.S. Pat Nos. 3,687,808; 4,469,863; 4,476,301; 5,023,243; 5,177,196; 5,188,897; 

5,264,423; 5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 5,405,939; 5,453,496; 

5,455,233; 5,466,677; 5,476,925; 5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 

5,571,799; 5,587,361; and 5,625,050, each of which is herein incosrporated by reference. 
5 Examples of the non-phosphorous containing backbones described above are 

disclosed in, e.g., U.S. Pat Nos. 5,034,506; 5,185,444; 5,214,134; 5,216,141; 5,235,033; 

5,264,562; 5,264,564; 5,405,938; 5,434,257; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 

5,596,086; 5,610,289; 5,602,240; 5,608,046; 5,610,289; 5,618,704; 5,623,070; 5,663,312; 

5,677,437; and 5,677,439, each of which is herein incorporated by reference. 
10 Another useful modified oligonucleotide is peptide nucleic acid (PNA), in which 

the sugar-backbone of an oligonucleotide is replaced with an amide containing backbone, 

e.g., an aminoethylglycine backbone. See U.S. Patent Nos. 5,539,082 and 5,714,331; and 

Nielsen et <d., Science, 254, 1497-1500 (1991), all of which are incorporated herein by 

reference. PNA antisense compounds are resistant to RNAse H digest and thus exhibit 
15 longer half-life. In addition, various modifications may be made in PNA backbones to 

impart desirable drug profiles such as better stability, increased drug uptake, higher 

affinity to target nucleic acid, etc. 

Alternatively, the antisense compounds are oligonucleotides containing modified 

nucleosides, i.e., modified purine or pyrimidine bases, e.g., 5 -substituted pyrimidines, 6- 
20 azapyrimidines, and N-2, N-6 and O-substituted purines, and the like. See e.g., U.S. Pat 

Nos. 3,687,808; 4,845,205; 5,130302; 5,175,273; 5,367,066; 5,432,272; 5,459,255; 

5,484,908; 5,502,177; 5,525,711; 5,587.469; 5,594,121; 5,596,091; 5,681,941; and 

5,750,692, each of which is incorporated herein by reference in its entirety. 

In addition, oligonucleotides with substituted or modified sugar moieties may also 
25 be used. For example, an antisense compound may have one or more 2' -O-methoxyethyl 

sugar moieties. See e.g., U.S. Pat Nos. 4,981,957; 5,118,800; 5319,080; 5,393,878; 

5,446,137; 5,466,786; 5,514,785; 5,567,811; 5,576,427; 5,591,722; 5,610,300; 

5,627,0531 5,639,873; 5,646,265; 5,658,873; 5,670,633; and 5,700,920, each of which is 

herein incorporated by reference. 
30 Other types of oligonucleotide modifications are also useful including linking an 

oligonucleotide to a lipid, phospholipid or cholesterol moiety, cholic acid, thioether, 
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aliphatic chain, polyamine, polyethylene glycol (PEG), or a protein or peptide. The 
modified oligonucleotides may exhibit increased uptake into cells, improved stability, 
i.e., resistance to nuclease digestion and other biodegradations. See e.g., U.S. Patent No. 
4,522,811; Bornham, Am. J. Hasp. Pharm., 15:210-218 (1994). 
5 Antisense compounds can be synthesized using any suitable methods known in 

the art In fact, antisense compounds may be custom made by commercial suppliers. 
Alternatively, antisense compounds may be prepared using DNA synthesizers 
commercially from various vendors, e.g., Applied Biosystems Group of Norwalk, CT. 
The antisense compounds can be formulated into a pharmaceutical composition 

10 with suitable carriers and administered into cells or tissue in vitro or in a patient using 

any suitable route of administration. Alternatively, the antisense compounds may also be 
used in a "gene-therapy" approach. That is, the oligonucleotide is subcloned into a 
suitable vector and transformed into human cells. Hie antisense oligonucleotide is then 
produced in vivo through transcription. Methods for gene therapy are disclosed in 

15 Section 6.3.2 below. 

53. Ribozyme Therapy 

In another embodiment, an enzymatic RNA or ribozyme is designed to target the 
nucleic acids encoding one or more of the interacting protein members of the protein 

20 complex of the present invention. In preferred embodiments, TsglOl nucleic acids are 
targeted. Ribozymes are RNA molecules, which have an enzymatic activity and are 
capable of repeatedly cleaving other separate RNA molecules in a nucleotide base 
sequence specific manner. See Kim et al 9 Proc. Nail Acad. ofScu USA* 84:8788 (1987); 
Haseloff and Gerlach, Nature, 334:585 (1988); and Jefferies et al., Nucleic Acid Res., 

25 17:1371 (1989). A ribozyme typically has two portions: a catalytic portion andabinding 
sequence that guides the binding of ribozymes to a target RNA through complementary 
base-pairing. Once the ribozyme is bound to a target RNA, it enzymatically cleaves the 
target RNA, typically destroying its ability to direct translation of an encoded protein. 
After a ribozyme has cleaved its RNA target, it is released from that target RNA and 

30 thereafter can bind and cleave another target That is, a single ribozyme molecule can 
repeatedly bind and cleave new targets. Therefore, one advantage of ribozyme treatment 



77 



WO 02/072790 



PCT/US02/08146 



is that a lower amount of exogenous RNA is required as compared to conventional 
anti sense therapies. In addition, ribozymes exhibit less affinity to mRNA targets than 
DNA-based aziti sense oligos, and therefore ate less prone to bind to wrong targets. 

In accordance with the present invention, a ribozyme may target any portions of 
5 the mRNA of one or more interacting protein members including TsglOl and HIV GAG. 
Methods for selecting a ribozyme target sequence and designing and making ribozymes 
are generally known in the art See e.g., U.S. Patent Nos. 4,987,071; 5,496,698; 
5,525,468; 5,631,359; 5,646,020; 5,672,511; and 6,140,491, each of which is 
incorporated herein by reference in its entirety. For example, suitable ribozymes may be 

10 designed in various configurations such as hammerhead motifs, hairpin motifs, hepatitis 
delta virus motifs, group I intron motifs, or RNase P RNA motifs. See e.g., U.S. Patent 
Nos. 4,987,071; 5,496,698; 5,525,468; 5,631,359; 5,646,020; 5,672,511; and 6,140,491; 
Rossi et al., AIDS Res. Human Retroviruses 8:183 (1992); Hampel and Tritz, 
Biochemistry 28:4929 (1989); Hampel et oZ M Nucleic Acids Res., 18:299 (1990); Perrotta 

15 and Been, Biochemistry 31:16 (1992); and Guerrier-Takada etal., Cell, 35:849 (1983). 
Ribozymes can be synthesized by the same methods used for normal RNA 
synthesis. For example, such methods are disclosed in Usman et al., /. Am. Chenu Soc, 
109:7845-7854 (1987) and Scaringe etal., Nucleic Acids Res., 18:5433^5441 (1990). 
Modified ribozymes may be synthesized by the methods disclosed in, e.g., U.S. Pat No. 

20 5,652,094; International Publication Nos. WO 91/03162; WO 92/07065 and WO 

93/15187; European Patent Application No. 92110298.4; Perrault et al., Nature, 344:565 
(1990); Pieken et al., Science, 253:314 (1991); and Usman and Cedergren, Trends in 
Biochenu Sci., 17:334 (1992). 

Ribozymes of the present invention may be administered to cells by any known 

25 methods, e.g., disclosed in International Publication No. WO 94/02595. For example, 
they can be administered directly to cells or tissue in vitro or in a patient through any 
suitable route, e.g., intravenous injection. Alternatively, they may be delivered in 
encapsulation in liposomes, by iontophoresis, or by incorporation into other vehicles such 
as hydrogels, cyclodextrins, biodegradable nanocapsules, and bioadhesive microspheres. 

30 In addition, they may also be delivered by gene therapy approach, using a DNA vector 
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from which the zibozyme SNA can be transcribed directly. Gene therapy methods are 
disclosed in detail below in Section 6.3.2. 

5 A Competitive Inhibition 

S The patient concentration and activity of a particular protein complex and the 

interacting protein members thereof identified in accordance with the present invention 
may also be inhibited by various other methods. For example, compounds identified in 
accordance with the methods described in Section 4 that are capable of interfering with or 
dissociating protein-protein interactions between the interacting protein members of a 
10 protein complex may be administered to cells or tissue in vitro or in a patient 

Compounds identified in in vitro binding assays described in Section 4 that bind to the 
TsglOl -containing protein complex or the interacting members thereof may also be used 
in the treatment 

In addition, useful agents also include incomplete proteins, i.e., fragments of the 

1 5 interacting protein members that are capable of binding to their respective binding 
partners in a protein complex but are defective with respect to their cellular functions. 
For example, binding domains of the interacting member proteins of a protein complex 
may be used as competitive inhibitors of die activities of the protein complex. As will be 
apparent to skilled artisans, derivatives or homologues of the binding domains may also 

20 be used. Binding domains can be easily identified using molecular biology techniques, 
e.g., mutagenesis in combination with yeast two-hybrid assays. Preferably, the protein 
fragment used is a fragment of an interacting protein member having a length of less than 
90%, 80%, more preferably less than 75%, 65%, 50%, or less than 40% of the full length 
of the protein member. In one embodiment, a HTV GAGp6 protein fragment capable of 

25 binding TsglOl is administered. Far example, suitable protein fragments can include a 
polypeptide having a contiguous span of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 18, 20 or 
25, preferably from 4 to 30, 40 or 50 amino adds or more of the sequence of HIV 
GAGp6 and is capable of interacting with TsglOl . Also, suitable protein fragments can 
also include a peptide capable of binding TsglOl and having an amino acid sequence of 

30 from 4 to 30 amino acids that is at least 75%, 80%, 82%, 85%, 87%, 90%, 95% or more 
identical to a contiguous span of amino acids of HIV GAG or GAGp6 of the same length. 
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Alternatively, a polypeptide capable of interacting with BffV OAQp6 and having a 
contiguous span of 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 18, 20 or 25, preferably from 4 
to 30, 40 or 50 or more amino acids of the amino acid sequence of TsglOl may be 
administered Also, other examples of suitable compounds include a peptide capable of 
5 binding HIV GAGp6 and having an amino acid sequence of from 4 to 30, 40, 50 or more 
amino acids that is at least 75%, 80%, 82%, 85%, 87%, 90%, 95% or more identical to a 
contiguous span of amino acids of TsglOl amino acid sequence of the same length. In 
addition, the administered compounds can also be an antibody or antibody fragment, 
preferably single-chain antibody immunoreactive with TsglOl or HIV GAGp6 or protein 

10 complexes of the present invention. 

The protein fragments suitable as competitive inhibitors can be delivered into 
cells by direct cell internalization, receptor mediated endocytosis, cor via a "transporter-" 
It is noted that when the target proteins or protein complexes to be modulated reside 
inside cells, the compound administered to cells in vitro or in vivo in the method of the 

15 present invention preferably is delivered into the cells in order to achieve optimal results. 
Thus, preferably, the compound to be delivered is associated with a transporter capable of 
increasing the uptake of the compound by cells having the target protein or protein 
complex. As used herein, the team "transporter" refers to an entity (e.g., a compound or a 
composition or a physical structure formed from multiple copies of a compound or 

20 multiple different compounds) that is capable of facilitating die uptake of a compound of 
the present invention by animal cells, particularly human cells. Typically, the cell uptake 
of a compound of the present invention in the presence of a "transporter 11 is at least 20% 
higher, preferably at least 40%, 50%, 75%, and more preferably at least 100% higher than 
the cell uptake of the compound in the absence of the "transporter." 

25 Many molecules and structures known in the art can be used as "transporters." In 

one embodiment, a penetratin is used as a transporter. For example, the homeodomain of 
Antennapedia, a Drosophila transcription factor, can be used as a transporter to deliver a 
compound of the present invention* Indeed, any suitable member of the penetratin class 
of peptides can be used to carry a compound of the present invention into cells. 

30 Penetratins are disclosed in, e.g., Denossi et aL, Trends CeU Biol, 8:84-87 (1998), which 
is incorporated herein by reference. Penetratins transport molecules attached thereto 
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across cytoplasm membranes or nucleus membranes efficiently in a receptor- 
independent, energy-independent, and cell type-independent manner. Methods for using 
a penetratin as a carrier to deliver oligonucleotides and polypeptides are also disclosed in 
U.S. Patent No. 6,080,724; Pooga et al, Nat. Biotech., 16:857 (1998); and Schutze et al„ 
5 /. Immunol, 157:650 (1996), all of which are incorporated herein by reference. U.S. 
Patent No. 6,080,724 defines the miTrim*! requirements for a penetratin peptide as a 
peptide of 16 amino acids with 6 to 10 of which being hydrophobic. The amino acid at 
position 6 counting from either the N- or C-terminal is tryptophan, while the amino acids 
at positions 3 and 5 counting from either the N- or C-temiinal are not both valine. 

10 Preferably, the helix 3 of the homeodomain of Drosopliila Antennapedia is used as a 
transporter. More preferably, a peptide having a sequence of the amino acids 43*58 of 
the homeodomain Antp is employed as a transporter. In addition, other naturally 
occurring homologs of the helix 3 of the homeodomain of Drosophila Antennapedia can 
also be used. For example, homeodomains of Fushi-tarazu and Engrailed have been 

15 shown to be capable of transporting peptides into cells. See Han et al., Mol Cells, 
10:728-32 (2000). As used herein, the term "penetratin" also encompasses peptoid 
analogs of the penetratin peptides. Typically, the penetratin peptides and peptoid analogs 
thereof are covalently linked to a compound to be delivered into cells thus increasing the 
cellular uptake of the compound. 

20 In another embodiment, the HIV-1 tat protein or a derivative thereof is used as a 

"transporter" covalently linked to a compound according to the present invention. The 
use of HIV-1 tat protein and derivatives thereof to deliver macromolecules into cells has 
been known in the art. See Green and Loewenstein, Celt, 55:1 179 (1988); Frankel and 
Pabo, Cell, 55:1189 (1988); Vives et al, J. Biol Chem., 272:16010-16017 (1997); 

25 Schwarze et al, Science, 285:1569-1572 (1999). It is known that the sequence 

responsible for cellular uptake consists of the highly basic legion, amino acid residues 
49-57. See e.g., Vives et al, J. Biol Chenu, 272:16010-16017 (1997); Wender etal, 
Proc. NaflAcad. Set USA, 97:13003-13008(2000). The basic domain is believed to 
target die lipid bilayer component of cell membranes. It causes a covalently linked 

30 protein or nucleic acid to cross cell membrane rapidly in a cell type-independent manner. 
Proteins ranging in size from 15 to 120 kD have been delivered with this technology into 
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a variety of cell types both in vitro and in vivo. See Schwarze et aL, Science, 285:1569- 
1572 (1999). Any HIV tat-derived peptides or peptoid analogs thereof capable of 
transporting macromolecules such as peptides can be used for purposes of the present 
invention. For example, any native tat peptides having the highly basic region, amino 
5 acid residues 49-57 can be used as a transporter by covalently linking it to the compound 
to be delivered. In addition, various analogs of the tat peptide of amino acid residues 49- 
57 can also be useful transporters for purposes of this invention. Examples of various 
such analogs are disclosed in Wender et aL, Proc. Nat'l Acad. ScL USA, 97: 13003-13008 
(2000) (which is incorporated herein by reference) including, e.g., d-Tat49-57, retro- 

10 inverso isomers of /- or d-Tat^s? (i.e., /-Tat57-49 and rf-Tats?^), L-arginine oligomers, D- 
arginine oligomers, L-lysine oligomers, D-lysine oligomers, L-histine oligomers, D- 
histine oligomers, L-ornithine oligomers, D-orni thine oligomers, and various 
homologues, derivatives (e.g M modified forms with conjugates linked to the small 
peptides) and peptoid analogs thereof. 

IS Other useful transporters known m the art include, but are not limited to, short 

peptide sequences derived from fibroblast growth factor (See Lin et aL, J. Biol Chenu, 
270:14255-14258 (1998)), Galparan (SeePoogzeta!., FASEBJ. 12:67-77 (1998)), and 
HSV-1 structural protein VP22 (JSee Elliott and OHare, Cell, 88:223-233 (1997)). 
As the above-described various transporters are generally peptides, fusion 

20 proteins can be conveniently made by recombinant expression to contain a transporter 
peptide covalently linked by a peptide bond to a competitive protein fragment 
Alternatively, conventional methods can be used to chemically synthesize a transporter 
peptide or a peptide of the present invention or both. 

The hybrid peptide can be administered to cells in vitro or to a patient in a 

25 suitable pharmaceutical composition as provided in Section 5. 

In addition to peptide-based transporters, various other types of transporters can 
also be used, including but not limited to cationic liposomes (see Rui et aL, J. Am. Cheiru 
Soc., 120:11213-11218 (1998)), dendrimers (Kono etaL, Bioconjugate Clienu, 10:1115- 
1121 (1999)), siderophores (Ghosh etal., Chem. BioL, 3:1011-1019 (1996)), etc. In a 

30 specific embodiment, the compound according to the present invention is encapsulated 
into liposomes for delivery into cells. 
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Additionally, when a compound according to the present invention is a peptide, it 
can be administered to cells by a gene therapy method. That is, a nucleic acid encoding 
the peptide can be administered to in vitro cells or to cells in vivo in a human or animal 
body. Any suitable gene therapy methods may be used for purposes of the present 
5 invention. Various gene therapy methods are well known in the art and are described in 
Section 6.3.2. below. Successes in gene therapy have been reported recently. See e.g., 
Ksyetal, Nature Genet., 24:257-61 (2000); Cavazzana-Calvo et aL. Science, 288:669 
(2000); and Blaese et at, Science, 270: 475 (1995); Kantoff, et al., 7, Exp. Med., 166:219 
(1987). 

10 

5.5. Gene Therapy 

In yet another embodiment, the gene therapy approach is taken to "knock out* 1 the 
gene encoding TsglOl, or to reduce the gene expression level. For example, the gene 
may be replaced with a different gene sequence or a non-functional sequence or simply 

15 deleted by homologous recombination. In another gene therapy embodiment, the method 
disclosed in U.S. Patent No. 5,641,670, which is incorporated herein by reference, may 
be used to reduce the expression of the TsglOl gene. Essentially, an exogenous DNA 
having at least a regulatory sequence, an exon and a splice donor site can be introduced 
into an endogenous gene encoding TsglOl by homologous recombination such that the 

20 regulatory sequence, the exon and the splice donor site present in the DNA construct 
become operari vely linked to die endogenous gene. As a result, the expression of the 
endogenous gene is controlled by die newly introduced exogenous regulatory sequence. 
Therefore, when the exogenous regulatory sequence is a strong gene expression 
repressor, the expression of the endogenous gene encoding the interacting protein 

25 member is reduced or blocked. See U.S. Patent No. 5,641,670. 

Various gene therapy methods are well known in the art Successes in gene 
therapy have been reported recently. See e.g., Kay et aL, Nature Genet., 24:257-61 
(2000); Cavazzana-Calvo et aL, Science, 288:669 (2000); and Blaese et at., Science, 270: 
475 (1995); Kantoff, et al., J. Exp. Med. 166:219 (1987). 

30 Any suitable gene therapy methods may be used for purposes of the present 

invention. Generally, a gene therapy vector is used to cany the nucleic acids (exogenous 
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nucleic acids) useful in modifying the endogenous TsglOl gene. The vector typically 
includes nucleic acid sequences that can direct site-specific homologous recombination. 
For example, an exogenous nucleic acid encoding a defective TsglOl protein, e.g., 
incapable of binding HIV QAGp6, is preferably carried within the gene therapy vector. 
5 Alternatively, the vector may contain sequences corresponding to the two ends of the 
endogenous TsglOl gene which can cause homologous recombination thereby "knocking 
out" the endogenous TsglOl gene. 

In one embodiment, the exogenous nucleic acid (gene) is incorporated into a 
pla&mid DNA vector. Many commercially available expression vectors may be useful for 

10 the present invention, including, e.g., pCEP4, pcDNAI, pIND, pSecTag2, pVAXl, 
pcDNA3.1, and pBI-EGFP, and pDisplay . 

Various viral vectors may also be used. Typically, in a viral vector, the viral 
genome is engineered to eliminate the disease-causing capability, e.g., the ability to 
replicate in the host cells. The exogenous nucleic acid to be introduced into a patient 

15 may be incorporated into the engineered viral genome, e.g., by inserting it into a viral 
gene that is non-essential to the viral infectivity. Viral vectors are convenient to use as 
they can be easily introduced into tissue cells by way of infection. Once in the host cell, 
the recombinant virus typically is integrated into the genome of the host cell. In rate 
instances, the recombinant virus may also replicate and remain as extrachromosomal 

20 elements. 

A large number of retroviral vectors have been developed for gene therapy. 
These include vectors derived from oncoretroviruses (e.g., MLV), lentiviruses (e.g., HIV 
and SIV) and other retroviruses. For example, gene therapy vectors have been developed 
based on murine leukemia virus (See, Cepko, et cd. 9 Cell, 37:1053-1062 (1984), Cone and 

25 Mulligan, Proc. Natl Acad. Sci. U.SA., 81:6349-6353 (1984)), mouse mammary tumor 
virus (See, Salmons et aUBiochem. Biopkys. Res. Commun.,159:l 191-1198 (1984)), 
gibbon ape leukemia virus (See 9 Miller et al., J. Virology, 65:2220-2224 (1991)), HIV, 
(See Shimada et al. 9 J. Clin. Invest., 88:1043-1047 (1991)), and avian retroviruses (See 
Cosset etaL, J. Virology, 64:1070-1078 (1990)). In addition, various retroviral vectors 

30 are also described in U.S. Patent Nos. 6,168,916; 6,140,1 11; 6,096,534; 5,985,655; 

5,911,983; 4,980,286; and 4,868, 116, all of which are incorporated herein by reference. 
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Adeno-associated virus (AAV) vectors have been successfully tested in clinical 
trials. See e.g., Kay et al, Nature Genet. 24:257-61 (2000). AAV is a naturally 
occurring defective virus that requires other viruses such as adenoviruses or herpes 
viruses as helper viruses. See Muzyczka, Curr. Top. Microbiol Immun., 158:97 (1992), 
5 A recombinant AAV virus useful as a gene therapy vector is disclosed in U.S. Patent No. 
6,153,436, which is incorporated herein by reference. 

Adenoviral vectors can also be useful for purposes of gene therapy in accordance 
with the present invention. For example, U.S. Patent No. 6,001,816 discloses an 
adenoviral vector, which is used to deliver a leptin gene intravenously to a mammal to 
10 treat obesity. Other recombinant adenoviral vectors may also be used, which include 
those disclosed in U.S. Patent Nos. 6,171,855; 6,140,087; 6,063,622; 6,033,908; and 
5,932,210, and Rosenfeld et a/., Science. 252:43 1-434 (1991); and Rosenfeld et al., Cell, 
68:143-155 (1992). 

Other useful viral vectors include recombinant hepatitis viral vectors (See, e.g., 

15 U.S. Patent No. 5,98 1,274), and recombinant entomopox vectors (See, e.g., U.S. Patent 
Nos. 5,721,352 and 5,753,258). Non-traditional vectors may also be used for purposes of 
this invention. For example, International Publication No. WO 94/18834 discloses a 
method of delivering DNA into mammalian cells by conjugating the DNA to be delivered 
with a polyelectrolyte to form a complex. The complex may be microinjected into or 

20 uptaken by cells. 

The exogenous nucleic acids may also be introduced into cells by way of 
receptor-mediated endocytosis. See e.g., U.S. Patent No. 6,090,619; Wu and Wu, J. Biol 
Chem., 263:14621 (1988); Curiel et ol., Proc. Natl Acad. ScL USA, 88:8850 (1991). For 
example, U.S. Patent No. 6,083,741 discloses introducing an exogenous nucleic acid into 

25 mammalian cells by associating the nucleic acid to a polycation moiety (e.g., poly-L- 
lysine having 3-100 lysine residues), which is itself coupled to an integrin receptor 
binding moiety (e.g., a cyclic peptide having the sequence RGD). 

Alternatively, the exogenous nucleic acids or vectors containing it can also be 
delivered into cells via amphiphiles. See e.g., U.S. Patent No. 6,071,890. Typically, the 

30 exogenous nucleic acid or a vector containing the nucleic acid forms a complex with the 
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cationic amphiphile. Mammalian cells contacted with the complex can readily take the 
complex up. 

The exogenous nucleic acids can be introduced into a patient for purposes of gene 
therapy by various methods known in the art For example, the exogenous gene 
5 sequences alone or in a conjugated or complex form described above, or incorporated 
into viral or DNA vectors, may be administered directly by injection into an appropriate 
tissue or organ of a patient Alternatively, catheters or like devices may be used for 
delivery into a target organ or tissue. Suitable catheters are disclosed in, e.g., U.S. Patent 
Nos. 4,186,745; 5397,307; 5,547,472; 5,674,192; and 6,129,705, all of which aie 
10 incorporated herein by reference. 

In addition, the exogenous nucleic acids or vectors containing the nucleic acids 
can be introduced into isolated cells using any known techniques such as calcium 
phosphate precipitation, microinjection, lipofection, electroporation, gene gun, receptor- 
mediated endocytosis, and the like. Cells expressing the exogenous nucleic acids may be 
15 selected and redelivered back to the patient by, e.g., injection or cell transplantation. The 
appropriate amount of cells delivered to a patient will vary with patient conditions, and 
desired effect, which can be determined by a skilled artisan. See e.g., U.S. Patent Nos. 
6,054,288; 6,048,524; and 6,048,729. Preferably, die cells used are autologous, i.e., cells 
obtained from the patient being treated. 

20 

6. Pharmaceutical Compositions and Formulations 
In another aspect of the present invention, pharmaceutical compositions are also 
provided containing one or more of the therapeutic agents provided in the present 
invention as described in Section 6. For example, such therapeutic agents include, but 

25 are not limited to, (1) small organic compounds selected based on the screening methods 
of the present invention capable of interfering with the interaction between TsglOl and 
HIV GAG or GAGp6 or another retroviral GAG containing a P(T/S)AP late domain 
motif, (2) antisense compounds specifically hybridizable to TsglOl nucleic acids (gene or 
mRNA), (3) ribozyme compounds specific to TsglOl nucleic acids (gene or mRNA), (4) 

30 antibodies immunoreactive with TsglOl or HIV GAG or GAGp6 or another retroviral 
GAG containing a P(T/S)AP late domain motif, (5) antibodies selectively 
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immunoreactive with a protein complex of the present invention, (6) small organic 
compounds capable of binding a protein complex of the present invention. (7) small 
peptide compounds as described above (optionally linked to a transporter) capable of 
interacting with TsglOl or a retroviral GAG polypeptide, (8) nucleic acids encoding the 
S antibodies or peptides, etc. Hie compositions are prepared as a pharmaceutical 

formulation suitable for administration into a patient Accordingly, the present invention 
also extends to pharmaceutical compositions, medicaments, drugs or other compositions 
containing one or more of the therapeutic agent in accordance with the present invention. 
In the pharmaceutical composition, an active compound identified in accordance 

10 with die present invention can be in any pharmaceutically acceptable salt form. As used 
herein, the term "pharmaceutically acceptable salts" refers to the relatively non-toxic, 
organic or inorganic salts of the compounds of the present invention, including inorganic 
or organic acid addition salts of the compound. Examples of such salts include, but are 
not limited to, hydrochloride salts, sulfate salts, bi sulfate salts, borate salts, nitrate salts, 

IS acetate salts, phosphate salts, hydrobromide salts, laurylsulfonate salts, glucoheptonate 
salts, oxalate salts, oleate salts, laurate salts, stearate salts, palmitate salts, valerate salts, 
benzoate salts, naththylate salts, mesylate salts, tosylate salts, citrate salts, lactate salts, 
maleate salts, succinate salts, tartrate salts, fumarate salts, and the like. See, e.g., Berge, et 
al., /. Pharnu Scu, 66:1-19 (1977). 

20 For oral delivery, the active compounds can be incorporated into a formulation 

that includes pharmaceutically acceptable carriers such as binders (e.g., gelatin, cellulose, 
gum tragacanth), excipients (e.g., starch, lactose), lubricants (e.g., magnesium stearate, 
silicon dioxide), disintegrating agents (e.g., alginate, Primogel, and corn starch), and 
sweetening or flavoring agents (e.g., gjucose, sucrose, saccharin, methyl salicylate, and 

25 peppermint). The formulation can be orally delivered in the form of enclosed gelatin 

capsules or compressed tablets. Capsules and tablets can be prepared in any conventional 
techniques. The capsules and tablets can also be coated with various coatings known in 
the art to modify the flavors, tastes, colors, and shapes of the capsules and tablets, hi 
addition, liquid carriers such as fatty oil can also be included in capsules. 

30 Suitable oral formulations can also be in the form of suspension, syrup, chewing 

gum, wafer, elixir, and the like. If desired, conventional agents for modifying flavors, 
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tastes, colors, and shapes of the special forms can also be included. In addition, for 
convenient administration by enteral feeding tube in patients unable to swallow, the 
active compounds can be dissolved in an acceptable lipophilic vegetable oil vehicle such 
as olive oil, corn oil and 8 af flower oil. 

5 Hie active compounds can also be administered parenterally in the form of 

solution or suspension, or in lyophilized form capable of conversion into a solution or 
suspension form before use. In such formulations, diluents or pharmaceutically 
acceptable carriers such as sterile water and physiological saline buffer can be used. 
Other conventional solvents, pH buffers, stabilizers, anti-bacteria agents, surfactants, and 

10 antioxidants can all be included. For example, useful components include sodium 

chloride, acetates, citrates or phosphates buffers, glycerin, dextrose, fixed oils, methyl 
parabens, polyethylene glycol, propylene glycol, sodium bisulfate, benzyl alcohol, 
ascorbic acid, and the like. The parenteral formulations can be stored in any conventional 
containers such as vials and ampoules. 

IS Routes of topical administration include nasal, bucal, mucosal, rectal, or vaginal 

applications. For topical administration, the active compounds can be formulated into 
lotions, creams, ointments, gels, powders, pastes, sprays, suspensions, drops and aerosols. 
Thus, one or more thickening agents, humectants, and stabilizing agents can be included 
in the formulations. Examples of such agents include, but are not limited to, 

20 polyethylene glycol, sorbitol, xanthan gum, petrolatum, beeswax, or mineral oil, lanolin, 
squalene, and the like. A special form of topical administration is delivery by a 
transdermal patch. Methods for preparing transdermal patches are disclosed, e.g., in 
Brown, et al., Annual Review of Medicine, 39:221-229 (1988), which is incorporated 
herein by reference. 

25 Subcutaneous implantation for sustained release of the active compounds may 

also be a suitable route of administration. Hits entails surgical procedures for implanting 
an active compound in any suitable formulation into a subcutaneous space, e.g., beneath 
the anterior abdominal wall. See, e.g., Wilson et of., /. Clin. Psych. 45:242-247 (1984). 
Hydrogels can be used as a carrier for the sustained release of the active compounds. 

30 Hydrogels are generally known in the art They are typically made by crosslinking high 
molecular weight biocompatible polymers into a network, which swells in water to form 
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a gel like material. Preferably, hydrogels is biodegradable or biosorbable. For purposes 
of this invention, hydrogels made of polyethylene glycols, collagen, or poly(glycolioco- 
L-lactic acid) may be useful. See, e.g., Phillips et aL, / Pharmaceut Set 73:1718-1720 
(1984). 

5 The active compounds can also be conjugated, to a water soluble non- 

immunogenic non-peptidic high molecular weight polymer to form a polymer conjugate. 
For example, an active compound is covalently linked to polyethylene glycol to form a 
conjugate. Typically, such a conjugate exhibits improved solubility, stability, and 
reduced toxicity and immunogenicity. Thus, when administered to a patient; the active 

10 compound in the conjugate can have a longer half-life in the body, and exhibit better 

efficacy. See generally, Burnham, Am. J. Hosp. Pharm. t 15:210-218 (1994), FEGylated 
proteins are currently being used in protein replacement therapies and for other 
therapeutic uses. For example, PBGylated interferon (PEG-INTRON A®) is clinically 
used for treating Hepatitis B. PBGylated adenosine deaminase (ADAGEN®) is being 

15 used to treat severe combined immunodeficiency disease (SCIDS). PEGylatedL- 
asparaginase (ONC APSPAR®) is being used to treat acute lymphoblastic leukemia 
(ALL). It is preferred that the covalent linkage between the polymer and the active 
compound and/or the polymer itself is hydrolytically degradable under physiological 
conditions. Such conjugates known as "prodrugs" can readily release the active 

20 compound inside the body. Controlled release of an active compound can also be 
achieved by incorporating the active ingredient into microcapsules, nanocapsules, or 
hydrogels generally known in die art 

Liposomes can also be used as carriers for the active compounds of the present 
invention. liposomes are micelles made of various lipids such as cholesterol, 

25 phospholipids, fatty acids, and derivatives thereof. Various modified lipids can also be 
used. liposomes can reduce the toxicity of the active compounds, and increase their 
stability. Methods for preparing liposomal suspensions containing active ingredients 
therein are generally known in the art See, e.g., U.S. Patent No. 4,522,81 1; Prescott, Ed, 
Methods in Cell Biology, Volume XIV, Academic Press, New York, N.Y. (1976). 

30 The active compounds can also be administered in combination with another 

active agent that synergistically treats or prevents the same symptoms or is effective for 
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another disease or symptom in the patient treated so long as the other active agent does 
not interfere with or adversely affect the effects of die active compounds of this 
invention. Such other active agents include but are not limited to anti-inflammation 
agents, antiviral agents, antibiotics, antifungal agents, antithrombotic agents, 
5 cardiovascular drugs, cholesterol lowering agents, anti-cancer drags, hypertension drugs, 
and the like. 

Generally, the toxicity profile and therapeutic efficacy of the therapeutic agents 
can be determined by standard pharmaceutical procedures in cell models or animal 
models, e.g., those provided in Section 7. As is known in the art, the LD S0 represents the 

10 dose lethal to about 50% of a tested population. The ED50 is a parameter indicating the 
dose therapeutically effective in about 50% of a tested population. Both LD 50 and ED50 
can be determined in cell models and animal models. In addition, the IC50 may also be 
obtained in cell models and animal models, which stands for the circulating plasma 
concentration that is effective in achieving about 50% of the Tmnrinuii inhibition of the 

15 symptoms of a disease or disorder. Such data may be used in designing a dosage range 
for clinical trials in humans. Typically, as will be apparent to skilled artisans, die dosage 
range for human use should be designed such that the range centers around the ED50 
and/or IC 50 , but significantly below the LD50 obtained from cell or animal models. 
It will be apparent to skilled artisans that therapeutically effective amount for each active 

20 compound to be included in a pharmaceutical composition of the present invention can 
vary with factors including but not limited to the activity of the compound used, stability 
of the active compound in the patient's body, the severity of the conditions to be 
alleviated, the total weight of the patient treated, the route of administration, the ease of 
absorption, distribution, and excretion of the active compound by the body, the age and 

25 sensitivity of the patient to be treated, and the like. The amount of administration can 
also be adjusted as the various factors change over time. 

EXAMPLES 

1. Yeast Two-Hvfarid System 
30 The principles and methods of die yeast two-hybrid system have been described 

in detail in The Yeast Two-Hybrid System, Battel and Fields, eds., pages 183-196, Oxford 
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University Press, New York, NY, 1997. The following is thus a description of the 
particular procedure that we used. 

The cDNA encoding the bait protein HIV GAQp6 was derived from HTV-1 
NY5/BRU isolate. The cDNA product was then introduced by recombination into the 
5 yeast expression vector pGBT.Q, which is a close derivative of pGBT.C {See Battel et. 
alt Nat Genet., 12:72-77 (1996)) in which the polylinker site has been modified to 
include Ml 3 sequencing sites. The new construct was selected directly in the yeast strain 
PNY200 for its ability to drive tryptophane synthesis (genotype of this strain; MATa 
trpl-901 teu2-3J12 ura3-52 fus3-200 ade2 gal4A gal80). In these yeast cells, the bait 

10 was produced as a C-terminal fusion protein with the DNA binding domain of die 
transcription factor Gal4 (amino acids 1 to 147). 

Prey libraries (e.g., a human spleen cDNA library) were transformed into the 
yeast strain BK100 (genotype of this strain: MATa trpl-901 leu2-3,112 ura3-52 his3-200 
gal4A gal80 LYS2::GAL-HIS3 GAL2-ADE2 met2::GAL7-lacZ), and selected for the 

15 ability to drive leucine synthesis. In these yeast cells, each cDNA was expressed as a 
fusion protein with the transcription activation domain of the transcription factor Gal4 
(amino acids 768 to 881) and a 9 amino acid hemagglutinin epitope tag. PNY200 cells 
(MATa mating type), expressing the bait, were then mated with BK100 cells (MATa 
mating type), expressing prey proteins from the prey library. The resulting diploid yeast 

20 cells expressing proteins interacting with the bait protein were selected for the ability to 
synthesize tryptophan, leucine, histidine, and adenine. DNA was prepared from each 
clone, transformed by electroporation into E. coli strain KC8 (Clontech KC8 
electrocompetent cells, Catalog No. C2023-1), and the cells were selected on ampicillin- 
containing plates in the absence of either tryptophane (selection for the bait plasmid) or 

25 leucine (selection for the library plasmid). DNA for both plasmids was prepared and 
sequenced by the dideoxynucleotide chain termination method. The identity of the bait 
cDNA insert was confirmed and the cDNA insert from the prey library plasmid was 
identified using the BLAST program to search against public nucleotide and protein 
databases. Plasmids from the prey library were then individually transformed into yeast 

30 cells together with a plasmid driving the synthesis of lamin and 5 other test proteins, 

respectively, fused to the Gal4 DNA binding domain. Clones that gave a positive signal 
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in the (3-galactosidase assay were considered false-positives and discarded. Plasmids for 
the remaining clones were transformed into yeast cells together with the original bait 
plasmid* Clones that gave a positive signal in the P-galactosidase assay were considered 
true positives. 

5 HTV GAGp6 sequence indicated in Table 1 was used in the yeast two-hybrid 

system described above. The isolated TsglOl prey sequences are summarized in Table 1. 
The QenBank Accession Nos. for the bait and prey proteins axe also provided in Table 1, 
upon which the bait and prey sequences are aligned. 

2, Production of Antibodies Selectively Immunoreactive with Protein Complex 

10 HTV GAGp6 and the UEV domain of TsglOl are recombinantly expressed in 

human host cells and isolated and purified. A protein complex is formed by mixing the 
two purified interacting proteins (fragments). A protein complex is also formed by 
mixing recombinandy expressed intact complete TsglOl and HIV GAGp6. The two 
protein complexes are used as antigens in immunizing a mouse. mRNA is isolated from 

15 the immunized mouse spleen cells, and first-strand cDNA is synthesized based on the 
mRNA. The V H and Vk genes are amplified from the thus synthesized cDNAs by PCR 
using appropriate primers. 

The amplified Vh and Vk genes are ligated together and subcloned into a 
phagemid vector for the construction of a phage display library. K colL cells are 

20 transformed with the ligation mixtures, and thus a phage display library is established. 
Alternatively, the ligated V H and Vk genes are subcloned into a vector suitable for 
ribosome display in which the V H -V* sequence is under the control of a T7 promoter. See 
Schaffitzel etaL, J. Imnuau Meth, 231:119-135 (1999), 

The libraries axe screened with the TsglOl-HIV GAGp6 complex and individual 

25 TsglOl and HIV GAGp6. Several rounds of screening are preferably performed. Clones 
corresponding to scFv fragments that bind the TsglOl-EHV GAGp6 complex, but not the 
individual TsglOl and HTV GAGp6 are selected and purified. A single purified clone is 
used to prepare an antibody selectively immunoreactive with the TsglOl-HIV GAGp6 
complex. The antibody is then verified by an immunochenristry method such as RIA and 

30 KT ISA- 
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In addition, the clones corresponding to scFv fragments that bind the TsglOl-HTV 
GAGp6 complex and also binds TsglOl and/or HIV GAGp6 may be selected. The scPv 
genes in the clones arc diversified by mutagenesis methods such as oligonucleotide- 
directed mutagenesis, error-prone PCR (See Lin-Goerke et aL, Biotechniques, 23:409 
5 (1997)), dNTP analogues (See Zaccolo et ah, J. Mol Biol, 255:589 (1996)), and other 
methods. Hie diversified clones are further screened in phage display or ribosome 
display libraries. In this manner, scFv fragments selectively immunoreactive with the 
TsglOl-HIV GAGp6 complex may be obtained 

10 3. Correlations between T*pl01-mV flAOpfi Interaction smA Ifly ftn^p 

Yeast two-hybrid assays were utilized to determine the effect of amino acid 
substitution mutations in the PTAP motif of HIV GAGp6 on the interaction between 
TsglOl and GAGp6. To prepare a yeast two-hybrid activation domain-TsglOl construct, 
a DNA fragment encompassing the full-length coding sequence for TsglOl according to 

15 GenBank Accession No. U82130 was obtained by PCR from a human fetal brain cDNA 
library and cloned into the EcoRI/Pstl sites of the activation domain parent plasmid 
GADpN2 (LEU2, CEN4, ARS1, ADHlp-SV40NLS-GAL4 (768-88 1)-MCS (multiple 
cloning site>POKlt, AmpR, ColEl_ori). 

To prepare the yeast two-hybrid DNA binding domain-HIVl GAGp6 construct, a ' 

20 DNA fragment corresponding to the HTVl GAGp6 peptide derived from the HIV1 .NL43 
strain GAG protein was obtained by PCR from the NL43 containing plasmid R9 Aapa and 
was cloned into the EcoRI/Sall sites of the binding domain parent plasmid pGBT.Q. 

The following amino acid substitution mutations were introduced by PCR into the 
HIV1 GAGp6 sequence in the yeast two-hybrid binding domain-HIVl GAGp6 construct 

25 described above. Hie mutations were verified by DNA sequence analysis* Such 
mutations are summarized in Table 2 below. 
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Table 2. Tested Mutations in OAGp6 Protein 



Mutant 
Constnict 


OAGp6 Peptide Sequence Surrounding the PTAP Motif 


P6/wt 
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P6/P7L 
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P6/A9R 














R 


















P6/P10L 
















L 

















To test the effect of the mutations, yeast cells of the strain Y189 purchased from 
5 Clontech (ura3-52 his3*200 ade2-101 tip 1-901 leu2-3,112 met gal4 gal80 

URA3::GALlp-lacZ) were co-transformed with the activation domain-TsglOl construct 
and one of the binding domain-mutant GAGp6 constructs or die binding domain-wild 
type OAOp6 construct Filter lift assays for p-Gal activity were conducted by lifting the 
transformed yeast colonies with filters, lysing die yeast cells by freezing and thawing, 
10 and contacting the lysed cells with X-Gal. Positive p-Gal activity indicates that the 
GAGpfi wild type or mutant protein interacts with TsglOl. All binding domain 
constructs woe also tested for self-activation of p-Gal activity. The results are shown in 
Table 3. 

15 Table 3. Interactions Between TsglOl and GAGp6 





P67wt 


P67E6G 


P6/P7L 


P6/A9R 


P6/P10L 


TsglOl 
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P67wt 












P6VE6G 












P6VP7L 












P6VA9R 












P67P10L 
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Thus, as is clear from Table 3, the mutations in the PTAP motif of HIV GAGp6 
abolished the interaction between TsglOl and HIV OAOp6 f while the p6/E6G mutation 
outside the PTAP motif did not result in the elimination of the Tsgl01-GAGp6 
interaction. 

The interactions between TsglOl and wild-type GAGp6 (WT) or the GAGp6 
PTAP mutants were further quantitated by performing liquid culture 3-galactosidase 
assays. Cultures were grown overnight in synthetic media (-Leu, -Trp, + glucose) in 96 
well plates, normalized for optical density, and lysed by addition of 6X lysis/substrate 
solution in 6X Z-buffer (6QmM KC1, 6mM MgS0 4 , 360mM NaaHFO* 240 mM 
NaHzPO^ 6mg/ml CPRG, 0.12U/ml lyticase, 0.075% NIM0). Cultures were incubated 
for 2 hr at 37°C, clarified by centrifugation, and the optical absorbancc of each 
supernatant was measured (575 nm). Full length TsglOl bound wild-type p6 in the two- 
hybrid liquid culture assay, resulting in high levels of P-galactosidase activity (>300-fold 
over background). Three different p6 point mutants were used to test whether the TsglOl 
binding interaction required the PTAP late domain motif within HTV-1 p6, and all three 
(P6L> A9R and P10L) reduced 0-galactosidase activity to background levels. Each of 
these point mutations also arrests HTV-1 budding at a late stage (Huang et al. 1995). 
These results are consistent with the hypothesis that die interaction between HIV GAGp6 
and the human cellular protein TsglOl is essential for viral budding to occur. 

5 4. In Vitro Binding Assays 

A fusion protein with a GST tag fused to the HIV- 1 GAGp6 domain was 
recombinantly expressed and purified by chromatography. In addition, a GAGp6 peptide 
containing the first 14 amino acid residues ("p6(l-14)") was synthesized chemically by 
standard peptide synthesis methods. The peptide was purified by conventional protein 
10 purification techniques, e.g., by chromatography. 

Nunc/Nalgene Maxisorp plates were incubated overnight at 4°C or for 1-2 hrs at 
room temperature in 100 |il of a protein coupling solution containing purified GST-p6 
and 50mM Carbonate, pH=9,6. Tins allowed the attachment of the GST-p6 fusion 
protein to the plates. liquids in the plates were then emptied and wells filled with 400 
15 }il/well of a blocking buffer (SuperBlocfc; Pierce-Endogen, Rockford, DL). After 

95 



AVO 02/072790 



PCTAJS02/08146 



incubating for 1 hour at room temperature, 100 |il of a mixture containing Drosophila S2 
cell lysate myc-tagged TsglOl (residues 1-207) and a specific amount of the p6(l-14) 
peptide were applied to the wells of the plate. This mixture was allowed to react for 2 
hours at room temperature to form p6:Tsgl01 protein-protein complexes. 
5 Plates were then washed 4 x lOOfi] with 1 x PBST solution (Ihvitrogen; Carlsbad, CA). 
After washing, lOOpi of l^g/ml solution of anti-myc monoclonal antibody (done 9E10; 
Roche Molecular Biochemicals; Indianapolis, IN) in 1 x PBST was added to the wells of 
the plate to detect the myc-epitope tag on the TsglOl protein. Plates were then washed 
again with 4 x 100^1 with 1 x PBST solution and lOOfil of ljig/ml solution of horseradish 

10 peroxidase (HRP) conjugated Goat anti-mouse IgO (Jackson Immunoresearch Labs; 
West Grove, Pennsylvania) in 1 x PBST was added to the wells of the plate to detect 
bound mouse anti-myc antibodies. Plates were then washed again with 4 x lOOjil with 1 
x PBST solution and 100 JJJ of fluorescent substrate (QuantaBlu; Pierce-Endogen, 
Rockford, IL) was added to all wells. After 30 minutes, 100 jjd. of stop solution was 

IS added to each well to inhibit the function of HRP. Plates were then read on a Packard 
Fusion instrument at an excitation wavelength of 325nm and an emission wavelength of 
420nm. The presence of fluorescent signals indicates binding of TsglOl to the fixed 
GST-p6. In contrast, the absence of fluorescent signals indicates that the PX1X2P- 
containing short peptide is capable of disrupting the interaction between TsglOl and HIV 

20 p6. 

Different concentrations of the p6(l-14) peptide were tested, and the relative 
intensities of the fluorescence signals obtained at different concentrations were plotted 
against the peptide concentrations. The competitive inhibition curve is shown in Figure 
2. Two Dixon plots are shown in Hgure 3 and Figure 4, respectively. 

25 

5. Yeast Screen To Identify Small Mnl^te frt ybitors Of The Interaction Between HIV 
GAGn6 And TsglOl 

Beta-galactosidase is used as a reporter enzyme to signal die interaction between 
yeast two-hybrid protein pairs expressed from plasmids in Saccharoniyces cerevisiae. 
30 Yeast strain MY209 {qde2 his3 leu2 trpl cyh2 ura3::GALIp-lacZ gal4 gal80 

fys2::GALlp-HIS3) bearing the plasmids Mp364 (LEU2 CEN4 ARS1 ADHlp-SV40NLS~ 
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GAL4 (768-881)-Tsgl01 (l-390yPGKltAmpR ColEljm) andMp206 (TRP1 CEN4 
ARS ADHlp~GAL4(U14ThHIVJ^gag (448-5<X))-AZWJt AmpR ColEJjori) is cnltunaiin 
synthetic complete media lacking leucine and tryptophan (SC -Leu — Trp) overnight at 
30°C. This culture is diluted to 0.01 OD^o units/ml using SC -Leu -Trp media. Hie 
5 diluted MY209 culture is dispensed into 96-weIl microplates. Compounds from a library 
of small molecules are added to the microplates; the final concentration of test 
compounds is approximately 60fjtM The assay plates are incubated at 30°C overnight 
The following day an aliquot of concentrated substrate/lysis buffer is added to each well 
and the plates incubated at 37°C for 1 -2 hours. At an appropriate time an aliquot of stop 

10 solution is added to each well to halt the beta-galactosidase reaction. For all microplates 
an absorbance reading is obtained to assay the generation of product from the enzyme 
substrate. The presence of putative inhibitors of the interaction between HIV p6 and 
TsglOl results in inhibition of the beta-galactosidase signal generated by MY209. 
Additional testing eliminates compounds that decreased expression of beta-galactosidase 

15 by affecting yeast cell growth and non-specific inhibitors that affected the beta- 
galactosidase signal generated by the interaction of an unrelated protein pair. 

Once a hit, i.e., a compound which inhibits the interaction between the viral and 
cellular proteins, is obtained, the compound is identified and subjected to further testing 
wherein the compounds are assayed at several concentrations to determine an IC50 value, 

20 this being the concentration of the compound at which the signal seen in the two-hybrid 
assay described in this Example is 50% of the signal seen in the absence of the inhibitor. 

All publications and patent applications mentioned in the specification are 
indicative of die level of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same 

25 extent as if each individual publication or patent application was specifically and 
individually indicated to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended 

30 claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated protein complex comprising a first protein which is Tsg 10 1 or 
a homologue or derivative or fragment thereof interacting with a second protein which is 
a retrovirus GAG polypeptide containing the P(T/S)AP late domain motif or a homologue 

5 or derivative or fragment of said retrovirus GAG polypeptide. 

2. The isolated protein complex according to Claim 1, wherein said first 
protein is selected from the group consisting of 

(i) TsglOl protein, 

10 (ii) a TsglOl protein homologue having an amino acid sequence at least 

90% identical to that of TsglOl and capable of interacting with HIV GAGp6, 

Ciii) a TsglOl protein fragment containing the TsglOl UEV domain, and 
Civ) a fusion protein containing said TsglOl protein, said TsglOl protein 
homologue or said TsglOl protein fragment 

15 

3. The isolated protein complex according to Claim 1 or 2, wherein said 
second protein is selected from the group consisting of 

(1) a retrovirus GAG polypeptide having the P(T/S)AP late 

domain motif, 

20 (2) a homologue of said retrovirus GAG polypeptide, said homologue 

having an amino acid sequence at least 90% identical to that of said retrovirus GAG 
polypeptide and capable of interacting with TsglOl, 

(3) a fragment of said retrovirus GAG polypeptide, said fragment being 
capable of interacting with TsglOl, and 
25 (4) a fusion protein containing said retrovirus GAG polypeptide, said 

retrovirus GAG polypeptide homologue or said retrovirus GAG polypeptide 
fragment. 

4. The isolated protein complex of Claim 1, 2 or 3, wherein said retrovirus is 
30 alentivirus. 
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5. Hie isolated protein complex of Claim 4, wherein said lentivirus is a 
primate lentivirus. 

6. Hie isolated protein complex of Claim 5, wherein said primate lentivirus 
is selected from the group consisting of HIV- 1, HEV-2, fHV-3, and simian 
immunodeficiency viruses. 

7. The isolated protein complex of Claim 4, wherein said lentivirus is a non- 
primate lentivirus selected from the group consisting of bovine lend viruses, feline 
lentiviruses, and ovine/caprine lenti viruses. 

8. The isolated protein complex according to any of Claims 1-6, wherein said 
second protein is an HIV GAG polypeptide or a homologue or derivative or fragment 
thereof. 

9. The isolated protein complex according to any of Claims 1 -6 and 8, 
wherein said second protein is a fusion protein containing (a) an HIV GAG polypeptide 
or (b) an HTV GAG polypeptide fragment 

10. The isolated protein complex according to any of Claims 1-6, 8 and 9, 
wherein said second protein is HIV GAGp6 or a homologue or derivative or fragment 
thereof. 

11. The isolated protein complex according to any of Claims 1-6, 8, 9 and 10, 
wherein said second protein is a fusion protein containing (a) an HIV GAGp6 
polypeptide or (b) an HIV GAGp6 fragment. 

12. The isolated protein complex according to Claim 10 or 11, wherein said 
HIV GAGp6 fragment contains an amino acid sequence of SEQ ID NO:31 or SEQ ID 
NO:32. 
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13. Hie isolated protein complex of Claim 10 or 11, wherein said HIV 
GAGp6 fragment has a contiguous span of at least 10 amino add residues of a naturally 
occurring HIV GAGp6, said contiguous span containing a P(T/S)AP late domain motif. 

S 14. A method for making the protein complex according to any one of Claims 

1-13, comprising the steps of: 

providing said first protein and said second protein; and 
contacting said first protein with said second protein. 

10 IS. A protein microarray comprising the protein complex according to any 

one of Claims 1-13. 

16. A fusion protein having a first polypeptide covalently linked to a second 
polypeptide, wherein said first polypeptide is TsglOl or a homologue or fragment 

15 thereof, and wherein said second polypeptide is HIV GAG or a homologue or fragment 
thereof. 

17. An isolated nucleic acid encoding the fusion protein of Claim 16. 

20 18. A method for selecting modulators of a protein complex according any 

one of Claims 1-13, comprising: 

providing the protein complex; 

contacting said protein complex with a test compound; and 
determining the presence or absence of binding of said test compound to said 
25 protein complex. 

19. A method for selecting modulators of a protein complex according any 
one of Claims 1-13, said method comprising: 

contacting said first protein with said second protein in the presence of one or 
30 more test compounds; and 

detecting interaction between said first protein and said second protein. 
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20. A method far selecting modulators of a protein complex according any 
one of Claims 1-13, said method comprising: 

contacting said protein complex with a test compound; and 

detecting interaction between said first protein and said second protein. 

21. The method of Claim 18, 19 or 20, wherein at least one of said first and 
second proteins is a fusion protein having a detectable tag, 

22. The method according to any one of Claims 18-20, wherein said 
contacting step is conducted in a substantially cell free environment. 

23. The method of according to any one of Claims 1 8-20, wherein said 
contacting step is conducted in a host cell, 

24. The method of Claim 22, wherein said host cell is a yeast cell. 

25. A method for selecting modulators of the interaction between die first and 
second proteins of the protein complex of any one of Claims 1-13, comprising: 

providing in a host cell a first fusion protein having said first protein, and a 
second fusion protein having said second protein, wherein a DNA binding domain is 
fused to one of said first and second proteins while a transcription-activating d omain is 
fused to the other of said first and second proteins; 

providing in said host cell a reporter gene, wherein the transcription of the 
reporter gene is determined by the interaction between the first protein and the second 
protein; 

allowing said first and second fusion proteins to interact with each other within 
said host cell in the presence of a test compound; and 

determining the presence or absence of expression of said reporter gene. 

26. The method of Claim 25, wherein said host cell is a yeast cell. 
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27. A composition comprising: 

(a) a first expression vector having a nucleic acid encoding the first protein 
according to any one of Claims 1-13; and 
5 (b) a second expression vector having a nucleic acid encoding the second protein 

according to any one of Claims 1-13. 



28. A host cell comprising the first and second expression vectors of Claim 

27. 

10 

29. A host cell comprising: 

a first expres sion cassette having a first promoter operably linked to a first nucleic 
acid encoding the first protein according to any one of Claims 1-13; and 

a second expression cassette having a second promoter operably linked to a 
15 second nucleic acid encoding the second protein according to any one of Claims 1-13. 

30. The host cell of Claim 28 or 29, wherein said host cell is a yeast cell. 

3 1 . The host cell of Claim 28, 29 or 30, wherein one of said first and second 
20 nucleic acids is linked to a nucleic acid encoding a DNA binding domain, and the other 

of said first and second nucleic acids is linked to a nucleic acid encoding a transcription- 
activation domain, whereby two fusion proteins can be produced in said host cell. 



32, The host cell of Claim 31, further comprising a reporter gene, wherein the 
25 expression of the reporter gene is determined by the interaction between the first protein 

and the second protein. 

33. A method for providing a compound capable of interfering with an 
interaction between the first and second proteins in the protein complex according to any 

30 one of Claims 1-13, comprising: 
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providing atomic coordinates defining a three-dimensional structure of said 
protein complex; and 

designing or selecting compounds capable of interfering with the interaction 
between said first protein and said second protein based on said atomic coordinates. 

34. A method for selecting a compound capable of inhibiting a protein-protein 
interaction between TsglOl and HIV GAGp6, comprising: 

contacting a test compound with a protein selected from group consisting of 

(i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
90% identical to that of TsglOl and capable of interacting with HIV GAGp6, 

(iii) a TsglOl protein fragment containing the TsglOl UEV dom a i n , and 

(iv) a fusion protein containing said TsglOl protein, said TsglOl protein 
homologue or said TsglOl protein fragment; and 

determining whether said test compound is capable of binding said protein. 

35. The method of Claim 34, further comprising testing a test compound 
capable of binding said protein for its ability to interfere with a protein-protein interaction 
between TsglOl and HIV GAGp6. 

36. The method of Claim 34 or 35, further comprising testing a test compound 
capable of binding said protein for its ability to inhibit HIV viral budding from an HIV- 
infected host cell. 

37. A method for selecting a compound capable of inhibiting a protein-protein 
interaction between TsglOl and HTV GAGp6, comprising! 

providing atomic coordinates defining a three-dimensional structure of a protein 
selected from group consisting of 

(i) TsglOl protein, 

(ii) a TsglOl protein homologue having an amino acid sequence at least 
90% identical to that of TsglOl and capable of interacting with HIV GAGp6, 
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(iii) a TsglOl protein fragment containing the TsglOl UEV domain, and 

(iv) a fusion protein containing said TsglOl protein, said TsglOl protein 
homologue or said TsglOl protein fragment; and 

designing or selecting compounds capable of interacting with said protein based 
5 on said atomic coordinates. 

38. The method of Claim 37, further comprising testing a compound capable 
of interacting with said protein for its ability to interfere with a protein-protein interaction 
between TsglOl and HIV GAGp6. 

10 

39. Hie method of Claim 37 or 38, further comprising testing a test compound 
capable of interacting with said protein for its ability to inhibit HIV viral budding from an 
HIV-infected host cell. 

15 40. An isolated antibody selectively immunoreactive with a protein complex 

according to any of Claims 1-13. 

41. A method for modulating, in a cell, a protein complex having a first 
protein which is TsglOl interacting with a second protein which is HIV GAG, said 

20 method comprising: 

reducing the concentration of said protein complex in the cell. 

42. Hie method of Claim 41, wherein said reducing step comprising 
interfering with an interaction between said first protein and said second protein. 

25 

43. The method of Claim 42, wherein said reducing step comprises 
administering to the cell a compound capable of interfering with an interaction between 
said first protein and said second protein. 

30 44. The method of Claim 43, wherein said compound is capable of binding 

TsglOl. 
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45. The method of Claim 43, wherein said compound is capable of binding the 
UEV domain of TsglOl protein. 

5 46. Hie method of Claim 43, wherein said compound is an antibody 

immunoreactive with TsglOl . 

47. The method of Claim 43, wherein said compound is a nucleic acid 
encoding an antibody immunoreactive with TsglOl . 

10 

48. The method of Claim 47, wherein said antibody is a single-chain antibody. 

49. The method of Claim 43, wherein said compound is an antibody 
selectively immunoreactive with said protein complex. 

15 

50. The method of Claim 49, wherein said compound is a nucleic acid 
encoding an antibody selectively immunoreactive with said protein complex. 

5 1 . Hie method of Claim 50, wherein said antibody is a single chain antibody. 

20 

52. The method of Claim 41, wherein said reducing step comprises reducing 
the concentration of TsglOl in the cell. 

53. The method of Claim 52, wherein said step of reducing the concentration 
25 of TsglOl in the cell comprises administering to the cell an an ti sense compound 

specifically hybridizing to a TsglOl nucleic acid. 

54. The method of Claim 52, wherein said step of reducing the concentration 
of TsglOl in the cell comprises administering to the cell a ribozyme compound 

30 specifically hybridizing to a TsglOl nucleic acid. 
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55. A method for inhibiting HIV viral budding from a host cell, comprising: 
interfering with an interaction between TsglOl and HIV GAG in the host cell. 

56. Hie method of Claim 55, wherein said interfering step comprises 
administering to die host cell a compound capable of interfering with the interaction 
between TsglOl and HIV GAG. 

57. The method of Claim 55, wherein said interfering step comprises 
administering to the host cell a compound capable of binding TsglOl protein. 

58. The method of Claim 57, wherein said compound is an antibody 
immnnoreactive with TsglOl. 

59. The method of Claim 57, wherein said compound is an antibody 
immunoreactive with TsglOl, said antibody be capable of binding the UEV domain of 
TsglOl. 

60. A method for inhibiting HIV viral budding from a host cell, comprising: 
administering to the host cell an antibody selectively immunoreactive with a 

protein complex comprising TsglOl interacting with HIV GAGp6. 

61 . A method for inhibiting HIV viral budding from a host cell, comprising: 
administering to the host cell a nucleic acid encoding an antibody immunoreactive 

with TsglOl. 

62. The method of Claim 61, wherein said antibody is a single chain antibody. 

63. A method for inhibiting HIV viral budding from a host cell, comprising: 
administering to the host cell a nucleic acid encoding an antibody selectively 

immunoreactive with a protein complex comprising TsglOl interacting with HIV 
GAGp6. 
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64. Hie method of Claim 63, wherein said antibody is a single chain antibody. 

65. Use of an antisense compound specifically hybridizing to a TsglOl 
nucleic acid in inhibiting HIV budding, treating HTV infection or preventing AIDS. 

66. Use of a ribozyme compound specifically hybridizing to a TsglOl nucleic 
acid in inhibiting HIV budding, treating HTV infection or preventing AIDS. 

67. Use of a compound capable of binding TsglOl protein in inhibiting HIV 
budding, treating HTV infection or preventing AIDS. 

68. Use of an antibody immunoreactive with TsglOl in inhibiting HTV 
budding, treating HIV infection or preventing AIDS. 

69. Use of an antibody capable of binding the UEV domain of TsglOl in 
inhibiting HIV budding, treating HIV infection or preventing AIDS. 

70. Use of a compound capable of interfering with the interaction between 
TsglOl and HIV GAG in inhibiting HIV budding, treating HIV infection or preventing 
AIDS. 

71. Use of an antibody selectively immunoreactive with a protein complex 
comprising TsglOl interacting with HIV GAG in inhibiting HIV budding, treating HIV 
infection or preventing AIDS. 

72. Use of a nucleic acid encoding an antibody immunoreactive with TsglOl 
in inhibiting HIV budding, treating HIV infection or preventing AIDS. 
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73. Use of a nucleic acid encoding an antibody selectively immunoreactive 
with a protein complex comprising TsglOl interacting with HIV GAG in inhibiting HIV 
budding, treating HIV infection or preventing AIDS. 
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<150> US 09/971,549 
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<160> 32 

<170> Patentln version 3.1 

<210> 1 

<211> 4 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 1 

Pro Thr Ala Pro 
1 

Page 1 



SUBSTITUTE SHEET (RULE 26) 



Wd 02/072790 



PCT/US02/08146 



<210> 



2 



<211> 



4 



<212> 



PRT 



<213> 



Mutant human immunodeficiency virus 



<400> 



2 



Pro Thr Thr Pro 
1 



<210> 3 

<211> 4 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 3 

Pro Ser Ala Pro 



<210> 4 

<211> 4 

<212> PRT 

<213> Mutant human immunodeficiency virus 

<400> 4 

Pro Ser Thr Pro 



<210> 5 

<211> 4 

<212> PRT 

<213> Mutant human immunodeficiency virus 

<400> 5 

Pro He Ala Pro 



1 



1 



1 



<210> 6 



<211> 4 



Page 2 



SUBSTITUTE SHEET (RULE 26) 



W(V 02/072790 PCT/US02/08146 
<212> PRT 

<213> Mutant human immunodeficiency virus 

<400> 6 

Pro lie Thr Pro 
1 

<210> 7 
<211> 5 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 7 

Glu Pro Thr Ala Pro 
1 5 

<210> 8 

<211> 5 

<212> PRT 

<213> Human iraniunode f i c i ency virus 

<4b0> 8 

Glu Pro Ser Ala Pro 
1 5 

<210> 9 

<211> 5 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 9 

Pro Thr Ala Pro Pro 
1 5 

<210> 10 
<211> 5 
<212> PRT 

<213> Human immunodeficiency virus 
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<400> 10 

Pro Ser Ala Pro Pro 
1 5 



<210> 11 

<211> 6 

<212> PRT 

<213> Human immunodeficiency virus 

<4Q0> 11 

Glu Pro Thr Ala Pro Pro 
1 5 

<210> 12 

<211> 6 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 12 

Glu Pro Ser Ala Pro Pro 
1 5 

<210> 13 

<211> 6 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 13 

Pro Glu Pro Thr Ala Pro 
1 5 

<210> 14 
<211> 6 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 14 

Pro Glu Pro Ser Ala Pro 
1 5 
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<210> 15 

<211> 7 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 15 

Arg Pro Glu Pro Thr Ala Pro 
1 5 

<210> 16 

<211> 7 

<212> PRT 

<213> Human immunodeficiency viruB 



<400> 16 

Arg Pro Glu Pro Ser Ala Pro 
1 5 



<210> 17 

<211> 7 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 17 

Pro Glu Pro Thr Ala Pro Pro 
1 5 

<210> 18 

<211> 7 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 18 

Pro Glu Pro Ser Ala Pro Pro 
1 5 

<210> 19 

<211> 8 
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<212> PRT 

<213> Human immunodeficiency virus 
<400> 19 

Glu Pro Thr Ala Pro Pro Glu Glu 
1 5 

<210> 20 

<211> 8 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 20 

Glu Pro Ser Ala Pro Pro Glu Glu 
1 5 

<210> 21 

<211> 8 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 21 

Glu Pro Thr Ala Pro Pro Ala Glu 
1 5 

<210> 22 

<211> 9 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 22 

Pro Glu Pro Thr Ala Pro Pro Glu Glu 
1 5 

<210> 23 
<211> 9 
<212> PRT 

<213> Human immunodeficiency virus 
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<400> 23 



Pro Glu Pro Thr Ala Pro Pro Ala Glu 
1 5 



<210> 24 

<211> B 

<212> PRT 

<213> Human immunodeficiency virus 

<400> 24 

Pro Glu Pro Ser Ala Pro Pro Glu Glu 

1 5 

<210> 25 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 25 

Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu 
15 10 

<210> 26 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 26 

Arg Pro Glu Pro Ser Ala Pro Pro Glu Glu 
15 10 

<210> 27 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 27 

Arg Pro Glu Pro Thr Ala Pro Pro Ala Glu 
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<210> 28 
<211> 10 
<212> PRT 

<213> Human immunodeficiency virus 
<400> * 28 

Arg Pro Glu Pro Ser Ala Pro Pro Ala Glu 
15 10 

<210> 29 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 29 

Leu Gin Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu 
15 10 

<210> 30 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 30 

lieu Gin Ser Arg Pro Glu Pro Ser Ala Pro Pro Glu Glu 
15 10 

<210> 31 
<211> 14 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 31 

Leu Gin Ser Arg Pro Glu Pro Thr Ala Pro Pro Glu Glu Ser 
15 10 

<210> 32 
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<211> 14 
<212> PRT 

<213> Human immunodef iciency virus 
<400> 32 

Leu Gin Ser Arg Pro Glu Pro Ser Ala Pro Pro Glu Glu Ser 
15 10 
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